Related papers: INT: Instance-Specific Negative Mining for Task-Ge…

Unleashing the Power of Prompt-driven Nucleus Instance Segmentation

Nucleus instance segmentation in histology images is crucial for a broad spectrum of clinical applications. Current dominant algorithms rely on regression of nuclear proxy maps. Distinguishing nucleus instances from the estimated maps…

Computer Vision and Pattern Recognition · Computer Science 2024-01-25 Zhongyi Shui , Yunlong Zhang , Kai Yao , Chenglu Zhu , Sunyi Zheng , Jingxiong Li , Honglin Li , Yuxuan Sun , Ruizhe Guo , Lin Yang

Visual Prompt Selection for In-Context Learning Segmentation

As a fundamental and extensively studied task in computer vision, image segmentation aims to locate and identify different semantic concepts at the pixel level. Recently, inspired by In-Context Learning (ICL), several generalist…

Computer Vision and Pattern Recognition · Computer Science 2024-07-16 Wei Suo , Lanqing Lai , Mengyang Sun , Hanwang Zhang , Peng Wang , Yanning Zhang

Insight Any Instance: Promptable Instance Segmentation for Remote Sensing Images

Instance segmentation of remote sensing images (RSIs) is an essential task for a wide range of applications such as land planning and intelligent transport. Instance segmentation of RSIs is constantly plagued by the unbalanced ratio of…

Computer Vision and Pattern Recognition · Computer Science 2024-09-12 Xuexue Li

Instance-wise Prompt Tuning for Pretrained Language Models

Prompt Learning has recently gained great popularity in bridging the gap between pretraining tasks and various downstream tasks. It freezes Pretrained Language Models (PLMs) and only tunes a few task-related parameters (prompts) for…

Computation and Language · Computer Science 2022-06-07 Yuezihan Jiang , Hao Yang , Junyang Lin , Hanyu Zhao , An Yang , Chang Zhou , Hongxia Yang , Zhi Yang , Bin Cui

Leveraging Hallucinations to Reduce Manual Prompt Dependency in Promptable Segmentation

Promptable segmentation typically requires instance-specific manual prompts to guide the segmentation of each desired object. To minimize such a need, task-generic promptable segmentation has been introduced, which employs a single…

Computer Vision and Pattern Recognition · Computer Science 2026-03-10 Jian Hu , Jiayi Lin , Junchi Yan , Shaogang Gong

Task-Specific Adaptation of Segmentation Foundation Model via Prompt Learning

Recently, foundation models trained on massive datasets to adapt to a wide range of tasks have attracted considerable attention and are actively being explored within the computer vision community. Among these, the Segment Anything Model…

Computer Vision and Pattern Recognition · Computer Science 2024-10-14 Hyung-Il Kim , Kimin Yun , Jun-Seok Yun , Yuseok Bae

MINT: Memory-Infused Prompt Tuning at Test-time for CLIP

Improving the generalization ability of Vision-Language Pre-trained Models (VLMs) under test-time data distribution shifts remains a critical challenge. The existing Test-Time Adaptation (TTA) methods fall short in fully leveraging the…

Computer Vision and Pattern Recognition · Computer Science 2025-06-05 Jiaming Yi , Ruirui Pan , Jishen Yang , Xiulong Yang

Semantic Prompting with Image-Token for Continual Learning

Continual learning aims to refine model parameters for new tasks while retaining knowledge from previous tasks. Recently, prompt-based learning has emerged to leverage pre-trained models to be prompted to learn subsequent tasks without the…

Computer Vision and Pattern Recognition · Computer Science 2024-03-19 Jisu Han , Jaemin Na , Wonjun Hwang

Improving Video Instance Segmentation by Light-weight Temporal Uncertainty Estimates

Instance segmentation with neural networks is an essential task in environment perception. In many works, it has been observed that neural networks can predict false positive instances with high confidence values and true positives with low…

Computer Vision and Pattern Recognition · Computer Science 2026-01-13 Kira Maag , Matthias Rottmann , Serin Varghese , Fabian Hueger , Peter Schlicht , Hanno Gottschalk

Curriculum Prompting Foundation Models for Medical Image Segmentation

Adapting large pre-trained foundation models, e.g., SAM, for medical image segmentation remains a significant challenge. A crucial step involves the formulation of a series of specialized prompts that incorporate specific clinical…

Computer Vision and Pattern Recognition · Computer Science 2024-09-04 Xiuqi Zheng , Yuhang Zhang , Haoran Zhang , Hongrui Liang , Xueqi Bao , Zhuqing Jiang , Qicheng Lao

Rethinking Interactive Image Segmentation with Low Latency, High Quality, and Diverse Prompts

The goal of interactive image segmentation is to delineate specific regions within an image via visual or language prompts. Low-latency and high-quality interactive segmentation with diverse prompts remain challenging for existing…

Computer Vision and Pattern Recognition · Computer Science 2024-04-02 Qin Liu , Jaemin Cho , Mohit Bansal , Marc Niethammer

INST-IT: Boosting Instance Understanding via Explicit Visual Prompt Instruction Tuning

Large Multimodal Models (LMMs) have made significant breakthroughs with the advancement of instruction tuning. However, while existing models can understand images and videos at a holistic level, they still struggle with instance-level…

Computer Vision and Pattern Recognition · Computer Science 2026-03-17 Wujian Peng , Lingchen Meng , Yitong Chen , Yiweng Xie , Yang Liu , Tao Gui , Hang Xu , Xipeng Qiu , Zuxuan Wu , Yu-Gang Jiang

Towards PerSense++: Advancing Training-Free Personalized Instance Segmentation in Dense Images

Segmentation in dense visual scenes poses significant challenges due to occlusions, background clutter, and scale variations. To address this, we introduce PerSense, an end-to-end, training-free, and model-agnostic one-shot framework for…

Computer Vision and Pattern Recognition · Computer Science 2025-08-21 Muhammad Ibraheem Siddiqui , Muhammad Umer Sheikh , Hassan Abid , Kevin Henry , Muhammad Haris Khan

ProMerge: Prompt and Merge for Unsupervised Instance Segmentation

Unsupervised instance segmentation aims to segment distinct object instances in an image without relying on human-labeled data. This field has recently seen significant advancements, partly due to the strong local correspondences afforded…

Computer Vision and Pattern Recognition · Computer Science 2024-09-30 Dylan Li , Gyungin Shin

Show or Tell? Effectively prompting Vision-Language Models for semantic segmentation

Large Vision-Language Models (VLMs) are increasingly being regarded as foundation models that can be instructed to solve diverse tasks by prompting, without task-specific training. We examine the seemingly obvious question: how to…

Computer Vision and Pattern Recognition · Computer Science 2026-02-09 Niccolo Avogaro , Thomas Frick , Mattia Rigotti , Andrea Bartezzaghi , Filip Janicki , Cristiano Malossi , Konrad Schindler , Roy Assaf

Progressive Multi-modal Conditional Prompt Tuning

Pre-trained vision-language models (VLMs) have shown remarkable generalization capabilities via prompting, which leverages VLMs as knowledge bases to extract information beneficial for downstream tasks. However, existing methods primarily…

Computer Vision and Pattern Recognition · Computer Science 2024-04-25 Xiaoyu Qiu , Hao Feng , Yuechen Wang , Wengang Zhou , Houqiang Li

PerSense: Training-Free Personalized Instance Segmentation in Dense Images

The emergence of foundational models has significantly advanced segmentation approaches. However, challenges still remain in dense scenarios, where occlusions, scale variations, and clutter impede precise instance delineation. To address…

Computer Vision and Pattern Recognition · Computer Science 2025-08-08 Muhammad Ibraheem Siddiqui , Muhammad Umer Sheikh , Hassan Abid , Muhammad Haris Khan

Prompt-Driven Image Analysis with Multimodal Generative AI: Detection, Segmentation, Inpainting, and Interpretation

Prompt-driven image analysis converts a single natural-language instruction into multiple steps: locate, segment, edit, and describe. We present a practical case study of a unified pipeline that combines open-vocabulary detection,…

Computer Vision and Pattern Recognition · Computer Science 2025-09-11 Kaleem Ahmad

Instance-aware Prompt Learning for Language Understanding and Generation

Recently, prompt learning has become a new paradigm to utilize pre-trained language models (PLMs) and achieves promising results in downstream tasks with a negligible increase of parameters. The current usage of discrete and continuous…

Computation and Language · Computer Science 2022-01-19 Feihu Jin , Jinliang Lu , Jiajun Zhang , Chengqing Zong

Towards Instance-level Image-to-Image Translation

Unpaired Image-to-image Translation is a new rising and challenging vision problem that aims to learn a mapping between unaligned image pairs in diverse domains. Recent advances in this field like MUNIT and DRIT mainly focus on…

Computer Vision and Pattern Recognition · Computer Science 2019-05-07 Zhiqiang Shen , Mingyang Huang , Jianping Shi , Xiangyang Xue , Thomas Huang