Related papers: Amodal Instance Segmentation with Diffusion Shape …

ShapeFormer: Shape Prior Visible-to-Amodal Transformer-based Amodal Instance Segmentation

Amodal Instance Segmentation (AIS) presents a challenging task as it involves predicting both visible and occluded parts of objects within images. Existing AIS methods rely on a bidirectional approach, encompassing both the transition from…

Computer Vision and Pattern Recognition · Computer Science 2024-04-18 Minh Tran , Winston Bounsavy , Khoa Vo , Anh Nguyen , Tri Nguyen , Ngan Le

AISFormer: Amodal Instance Segmentation with Transformer

Amodal Instance Segmentation (AIS) aims to segment the region of both visible and possible occluded parts of an object instance. While Mask R-CNN-based AIS approaches have shown promising results, they are unable to model high-level…

Computer Vision and Pattern Recognition · Computer Science 2024-03-19 Minh Tran , Khoa Vo , Kashu Yamazaki , Arthur Fernandes , Michael Kidd , Ngan Le

Amodal Segmentation Based on Visible Region Segmentation and Shape Prior

Almost all existing amodal segmentation methods make the inferences of occluded regions by using features corresponding to the whole image. This is against the human's amodal perception, where human uses the visible part and the shape prior…

Computer Vision and Pattern Recognition · Computer Science 2020-12-22 Yuting Xiao , Yanyu Xu , Ziming Zhong , Weixin Luo , Jiawei Li , Shenghua Gao

Using Diffusion Priors for Video Amodal Segmentation

Object permanence in humans is a fundamental cue that helps in understanding persistence of objects, even when they are fully occluded in the scene. Present day methods in object segmentation do not account for this amodal nature of the…

Computer Vision and Pattern Recognition · Computer Science 2024-12-09 Kaihua Chen , Deva Ramanan , Tarasha Khurana

Sequential Amodal Segmentation via Cumulative Occlusion Learning

To fully understand the 3D context of a single image, a visual system must be able to segment both the visible and occluded regions of objects, while discerning their occlusion order. Ideally, the system should be able to handle any object…

Computer Vision and Pattern Recognition · Computer Science 2024-05-10 Jiayang Ao , Qiuhong Ke , Krista A. Ehinger

Amodal Segmentation for Laparoscopic Surgery Video Instruments

Segmentation of surgical instruments is crucial for enhancing surgeon performance and ensuring patient safety. Conventional techniques such as binary, semantic, and instance segmentation share a common drawback: they do not accommodate the…

Computer Vision and Pattern Recognition · Computer Science 2024-08-05 Ruohua Shi , Zhaochen Liu , Lingyu Duan , Tingting Jiang

DiffMap: Enhancing Map Segmentation with Map Prior Using Diffusion Model

Constructing high-definition (HD) maps is a crucial requirement for enabling autonomous driving. In recent years, several map segmentation algorithms have been developed to address this need, leveraging advancements in Bird's-Eye View (BEV)…

Computer Vision and Pattern Recognition · Computer Science 2024-09-04 Peijin Jia , Tuopu Wen , Ziang Luo , Mengmeng Yang , Kun Jiang , Zhiquan Lei , Xuewei Tang , Ziyuan Liu , Le Cui , Bo Zhang , Long Huang , Diange Yang

ProGiDiff: Prompt-Guided Diffusion-Based Medical Image Segmentation

Widely adopted medical image segmentation methods, although efficient, are primarily deterministic and remain poorly amenable to natural language prompts. Thus, they lack the capability to estimate multiple proposals, human interaction, and…

Computer Vision and Pattern Recognition · Computer Science 2026-01-23 Yuan Lin , Murong Xu , Marc Hölle , Chinmay Prabhakar , Andreas Maier , Vasileios Belagiannis , Bjoern Menze , Suprosanna Shit

CamDiff: Camouflage Image Augmentation via Diffusion Model

The burgeoning field of camouflaged object detection (COD) seeks to identify objects that blend into their surroundings. Despite the impressive performance of recent models, we have identified a limitation in their robustness, where…

Computer Vision and Pattern Recognition · Computer Science 2023-04-13 Xue-Jing Luo , Shuo Wang , Zongwei Wu , Christos Sakaridis , Yun Cheng , Deng-Ping Fan , Luc Van Gool

CamoDiffusion: Camouflaged Object Detection via Conditional Diffusion Models

Camouflaged Object Detection (COD) is a challenging task in computer vision due to the high similarity between camouflaged objects and their surroundings. Existing COD methods primarily employ semantic segmentation, which suffers from…

Computer Vision and Pattern Recognition · Computer Science 2023-05-30 Zhongxi Chen , Ke Sun , Xianming Lin , Rongrong Ji

Medical Semantic Segmentation with Diffusion Pretrain

Recent advances in deep learning have shown that learning robust feature representations is critical for the success of many computer vision tasks, including medical image segmentation. In particular, both transformer and…

Computer Vision and Pattern Recognition · Computer Science 2025-02-03 David Li , Anvar Kurmukov , Mikhail Goncharov , Roman Sokolov , Mikhail Belyaev

Perceiving the Invisible: Proposal-Free Amodal Panoptic Segmentation

Amodal panoptic segmentation aims to connect the perception of the world to its cognitive understanding. It entails simultaneously predicting the semantic labels of visible scene regions and the entire shape of traffic participant…

Computer Vision and Pattern Recognition · Computer Science 2022-05-31 Rohit Mohan , Abhinav Valada

A2VIS: Amodal-Aware Approach to Video Instance Segmentation

Handling occlusion remains a significant challenge for video instance-level tasks like Multiple Object Tracking (MOT) and Video Instance Segmentation (VIS). In this paper, we propose a novel framework, Amodal-Aware Video Instance…

Computer Vision and Pattern Recognition · Computer Science 2025-04-11 Minh Tran , Thang Pham , Winston Bounsavy , Tri Nguyen , Ngan Le

High-Precision Dichotomous Image Segmentation via Probing Diffusion Capacity

In the realm of high-resolution (HR), fine-grained image segmentation, the primary challenge is balancing broad contextual awareness with the precision required for detailed object delineation, capturing intricate details and the finest…

Computer Vision and Pattern Recognition · Computer Science 2025-07-01 Qian Yu , Peng-Tao Jiang , Hao Zhang , Jinwei Chen , Bo Li , Lihe Zhang , Huchuan Lu

InpDiffusion: Image Inpainting Localization via Conditional Diffusion Models

As artificial intelligence advances rapidly, particularly with the advent of GANs and diffusion models, the accuracy of Image Inpainting Localization (IIL) has become increasingly challenging. Current IIL methods face two main challenges: a…

Computer Vision and Pattern Recognition · Computer Science 2025-01-07 Kai Wang , Shaozhang Niu , Qixian Hao , Jiwei Zhang

Diffusion Model for Camouflaged Object Detection

Camouflaged object detection is a challenging task that aims to identify objects that are highly similar to their background. Due to the powerful noise-to-image denoising capability of denoising diffusion models, in this paper, we propose a…

Computer Vision and Pattern Recognition · Computer Science 2023-08-08 Zhennan Chen , Rongrong Gao , Tian-Zhu Xiang , Fan Lin

Denoising Diffusion Semantic Segmentation with Mask Prior Modeling

The evolution of semantic segmentation has long been dominated by learning more discriminative image representations for classifying each pixel. Despite the prominent advancements, the priors of segmentation masks themselves, e.g.,…

Computer Vision and Pattern Recognition · Computer Science 2023-06-23 Zeqiang Lai , Yuchen Duan , Jifeng Dai , Ziheng Li , Ying Fu , Hongsheng Li , Yu Qiao , Wenhai Wang

MaskDiffusion: Exploiting Pre-trained Diffusion Models for Semantic Segmentation

Semantic segmentation is essential in computer vision for various applications, yet traditional approaches face significant challenges, including the high cost of annotation and extensive training for supervised learning. Additionally, due…

Computer Vision and Pattern Recognition · Computer Science 2024-03-19 Yasufumi Kawano , Yoshimitsu Aoki

LEAF: Latent Diffusion with Efficient Encoder Distillation for Aligned Features in Medical Image Segmentation

Leveraging the powerful capabilities of diffusion models has yielded quite effective results in medical image segmentation tasks. However, existing methods typically transfer the original training process directly without specific…

Computer Vision and Pattern Recognition · Computer Science 2025-07-25 Qilin Huang , Tianyu Lin , Zhiguang Chen , Fudan Zheng

Implicit and Explicit Language Guidance for Diffusion-based Visual Perception

Text-to-image diffusion models have shown powerful ability on conditional image synthesis. With large-scale vision-language pre-training, diffusion models are able to generate high-quality images with rich texture and reasonable structure…

Computer Vision and Pattern Recognition · Computer Science 2024-08-16 Hefeng Wang , Jiale Cao , Jin Xie , Aiping Yang , Yanwei Pang