Related papers: SimpleClick: Interactive Image Segmentation with S…

PseudoClick: Interactive Image Segmentation with Click Imitation

The goal of click-based interactive image segmentation is to obtain precise object segmentation masks with limited user interaction, i.e., by a minimal number of user clicks. Existing methods require users to provide all the clicks: by…

Computer Vision and Pattern Recognition · Computer Science 2022-07-28 Qin Liu , Meng Zheng , Benjamin Planche , Srikrishna Karanam , Terrence Chen , Marc Niethammer , Ziyan Wu

Exploring Plain Vision Transformer Backbones for Object Detection

We explore the plain, non-hierarchical Vision Transformer (ViT) as a backbone network for object detection. This design enables the original ViT architecture to be fine-tuned for object detection without needing to redesign a hierarchical…

Computer Vision and Pattern Recognition · Computer Science 2022-06-13 Yanghao Li , Hanzi Mao , Ross Girshick , Kaiming He

ClickSeg3D: Few-Click Interactive Segmentation via Semantic Embeddings

Interactive segmentation allows efficient label generation by leveraging user-provided clicks to progressively refine predictions, which is critical when fully supervised labels are costly or generalization to unseen classes is needed.…

Computer Vision and Pattern Recognition · Computer Science 2026-05-19 Xueyang Kang , Zijian Yu , Kourosh Khoshelham , Liangliang Nan

Interactive Image Segmentation with Cross-Modality Vision Transformers

Interactive image segmentation aims to segment the target from the background with the manual guidance, which takes as input multimodal data such as images, clicks, scribbles, and bounding boxes. Recently, vision transformers have achieved…

Computer Vision and Pattern Recognition · Computer Science 2023-07-06 Kun Li , George Vosselman , Michael Ying Yang

Learning from Exemplars for Interactive Image Segmentation

Interactive image segmentation enables users to interact minimally with a machine, facilitating the gradual refinement of the segmentation mask for a target of interest. Previous studies have demonstrated impressive performance in…

Computer Vision and Pattern Recognition · Computer Science 2024-06-18 Kun Li , Hao Cheng , George Vosselman , Michael Ying Yang

PiClick: Picking the desired mask from multiple candidates in click-based interactive segmentation

Click-based interactive segmentation aims to generate target masks via human clicking, which facilitates efficient pixel-level annotation and image editing. In such a task, target ambiguity remains a problem hindering the accuracy and…

Computer Vision and Pattern Recognition · Computer Science 2024-06-18 Cilin Yan , Haochen Wang , Jie Liu , Xiaolong Jiang , Yao Hu , Xu Tang , Guoliang Kang , Efstratios Gavves

SegViT: Semantic Segmentation with Plain Vision Transformers

We explore the capability of plain Vision Transformers (ViTs) for semantic segmentation and propose the SegVit. Previous ViT-based segmentation networks usually learn a pixel-level representation from the output of the ViT. Differently, we…

Computer Vision and Pattern Recognition · Computer Science 2022-12-13 Bowen Zhang , Zhi Tian , Quan Tang , Xiangxiang Chu , Xiaolin Wei , Chunhua Shen , Yifan Liu

FocalClick: Towards Practical Interactive Image Segmentation

Interactive segmentation allows users to extract target masks by making positive/negative clicks. Although explored by many previous works, there is still a gap between academic approaches and industrial needs: first, existing models are…

Computer Vision and Pattern Recognition · Computer Science 2022-04-19 Xi Chen , Zhiyan Zhao , Yilei Zhang , Manni Duan , Donglian Qi , Hengshuang Zhao

Native Segmentation Vision Transformers

Uniform downsampling remains the de facto standard for reducing spatial resolution in vision backbones. In this work, we propose an alternative design built around a content-aware spatial grouping layer, that dynamically assigns tokens to a…

Computer Vision and Pattern Recognition · Computer Science 2025-05-23 Guillem Brasó , Aljoša Ošep , Laura Leal-Taixé

Interactive Object Segmentation with Dynamic Click Transform

In the interactive segmentation, users initially click on the target object to segment the main body and then provide corrections on mislabeled regions to iteratively refine the segmentation masks. Most existing methods transform these…

Computer Vision and Pattern Recognition · Computer Science 2021-06-22 Chun-Tse Lin , Wei-Chih Tu , Chih-Ting Liu , Shao-Yi Chien

Reviving Iterative Training with Mask Guidance for Interactive Segmentation

Recent works on click-based interactive segmentation have demonstrated state-of-the-art results by using various inference-time optimization schemes. These methods are considerably more computationally expensive compared to feedforward…

Computer Vision and Pattern Recognition · Computer Science 2021-02-15 Konstantin Sofiiuk , Ilia A. Petrov , Anton Konushin

FocalClick-XL: Towards Unified and High-quality Interactive Segmentation

Interactive segmentation enables users to extract binary masks of target objects through simple interactions such as clicks, scribbles, and boxes. However, existing methods often support only limited interaction forms and struggle to…

Computer Vision and Pattern Recognition · Computer Science 2025-06-18 Xi Chen , Hengshuang Zhao

MIDeepSeg: Minimally Interactive Segmentation of Unseen Objects from Medical Images Using Deep Learning

Segmentation of organs or lesions from medical images plays an essential role in many clinical applications such as diagnosis and treatment planning. Though Convolutional Neural Networks (CNN) have achieved the state-of-the-art performance…

Computer Vision and Pattern Recognition · Computer Science 2021-05-27 Xiangde Luo , Guotai Wang , Tao Song , Jingyang Zhang , Michael Aertsen , Jan Deprest , Sebastien Ourselin , Tom Vercauteren , Shaoting Zhang

Robust Interactive Semantic Segmentation of Pathology Images with Minimal User Input

From the simple measurement of tissue attributes in pathology workflow to designing an explainable diagnostic/prognostic AI tool, access to accurate semantic segmentation of tissue regions in histology images is a prerequisite. However,…

Image and Video Processing · Electrical Eng. & Systems 2021-08-31 Mostafa Jahanifar , Neda Zamani Tajeddin , Navid Alemi Koohbanani , Nasir Rajpoot

Deep learning-based interactive segmentation in remote sensing

Interactive segmentation, a computer vision technique where a user provides guidance to help an algorithm segment a feature of interest in an image, has achieved outstanding accuracy and efficient human-computer interaction. However, few…

Computer Vision and Pattern Recognition · Computer Science 2025-05-14 Zhe Wang , Shoukun Sun , Xiang Que , Xiaogang Ma , Carmen Galaz Garcia

SafeClick: Error-Tolerant Interactive Segmentation of Any Medical Volumes via Hierarchical Expert Consensus

Foundation models for volumetric medical image segmentation have emerged as powerful tools in clinical workflows, enabling radiologists to delineate regions of interest through intuitive clicks. While these models demonstrate promising…

Image and Video Processing · Electrical Eng. & Systems 2025-06-24 Yifan Gao , Jiaxi Sheng , Wenbin Wu , Haoyue Li , Yaoxian Dong , Chaoyang Ge , Feng Yuan , Xin Gao

Easy3D: A Simple Yet Effective Method for 3D Interactive Segmentation

The increasing availability of digital 3D environments, whether through image-based 3D reconstruction, generation, or scans obtained by robots, is driving innovation across various applications. These come with a significant demand for 3D…

Computer Vision and Pattern Recognition · Computer Science 2025-04-16 Andrea Simonelli , Norman Müller , Peter Kontschieder

Minimalist and High-Performance Semantic Segmentation with Plain Vision Transformers

In the wake of Masked Image Modeling (MIM), a diverse range of plain, non-hierarchical Vision Transformer (ViT) models have been pre-trained with extensive datasets, offering new paradigms and significant potential for semantic…

Computer Vision and Pattern Recognition · Computer Science 2023-10-20 Yuanduo Hong , Jue Wang , Weichao Sun , Huihui Pan

SPT: Sequence Prompt Transformer for Interactive Image Segmentation

Interactive segmentation aims to extract objects of interest from an image based on user-provided clicks. In real-world applications, there is often a need to segment a series of images featuring the same target object. However, existing…

Computer Vision and Pattern Recognition · Computer Science 2024-12-16 Senlin Cheng , Haopeng Sun

MFP: Making Full Use of Probability Maps for Interactive Image Segmentation

In recent interactive segmentation algorithms, previous probability maps are used as network input to help predictions in the current segmentation round. However, despite the utilization of previous masks, useful information contained in…

Computer Vision and Pattern Recognition · Computer Science 2024-05-13 Chaewon Lee , Seon-Ho Lee , Chang-Su Kim