Related papers: Repurposing SAM for User-Defined Semantics Aware S…

Segment Anything Model (SAM) Enhanced Pseudo Labels for Weakly Supervised Semantic Segmentation

Weakly supervised semantic segmentation (WSSS) aims to bypass the need for laborious pixel-level annotation by using only image-level annotation. Most existing methods rely on Class Activation Maps (CAM) to derive pixel-level pseudo-labels…

Computer Vision and Pattern Recognition · Computer Science 2023-11-07 Tianle Chen , Zheda Mai , Ruiwen Li , Wei-lun Chao

Do Instance Priors Help Weakly Supervised Semantic Segmentation?

Semantic segmentation requires dense pixel-level annotations, which are costly and time-consuming to acquire. To address this, we present SeSAM, a framework that uses a foundational segmentation model, i.e. Segment Anything Model (SAM),…

Computer Vision and Pattern Recognition · Computer Science 2026-04-14 Anurag Das , Anna Kukleva , Xinting Hu , Yuki M. Asano , Bernt Schiele

Universal Organizer of SAM for Unsupervised Semantic Segmentation

Unsupervised semantic segmentation (USS) aims to achieve high-quality segmentation without manual pixel-level annotations. Existing USS models provide coarse category classification for regions, but the results often have blurry and…

Multimedia · Computer Science 2024-05-21 Tingting Li , Gensheng Pei , Xinhao Cai , Huafeng Liu , Qiong Wang , Yazhou Yao

Semantic-SAM: Segment and Recognize Anything at Any Granularity

In this paper, we introduce Semantic-SAM, a universal image segmentation model to enable segment and recognize anything at any desired granularity. Our model offers two key advantages: semantic-awareness and granularity-abundance. To…

Computer Vision and Pattern Recognition · Computer Science 2023-07-11 Feng Li , Hao Zhang , Peize Sun , Xueyan Zou , Shilong Liu , Jianwei Yang , Chunyuan Li , Lei Zhang , Jianfeng Gao

UnSAMv2: Self-Supervised Learning Enables Segment Anything at Any Granularity

The Segment Anything Model (SAM) family has become a widely adopted vision foundation model, but its ability to control segmentation granularity remains limited. Users often need to refine results manually - by adding more prompts or…

Computer Vision and Pattern Recognition · Computer Science 2025-11-18 Junwei Yu , Trevor Darrell , XuDong Wang

AM-SAM: Automated Prompting and Mask Calibration for Segment Anything Model

Segment Anything Model (SAM) has gained significant recognition in the field of semantic segmentation due to its versatile capabilities and impressive performance. Despite its success, SAM faces two primary limitations: (1) it relies…

Computer Vision and Pattern Recognition · Computer Science 2026-01-21 Yuchen Li , Li Zhang , Youwei Liang , Pengtao Xie

Propagating Semantic Labels in Video Data

Semantic Segmentation combines two sub-tasks: the identification of pixel-level image masks and the application of semantic labels to those masks. Recently, so-called Foundation Models have been introduced; general models trained on very…

Computer Vision and Pattern Recognition · Computer Science 2023-10-03 David Balaban , Justin Medich , Pranay Gosar , Justin Hart

SSP-SAM: SAM with Semantic-Spatial Prompt for Referring Expression Segmentation

The Segment Anything Model (SAM) excels at general image segmentation but has limited ability to understand natural language, which restricts its direct application in Referring Expression Segmentation (RES). Toward this end, we propose…

Computer Vision and Pattern Recognition · Computer Science 2026-03-20 Wei Tang , Xuejing Liu , Yanpeng Sun , Zechao Li

ConformalSAM: Unlocking the Potential of Foundational Segmentation Models in Semi-Supervised Semantic Segmentation with Conformal Prediction

Pixel-level vision tasks, such as semantic segmentation, require extensive and high-quality annotated data, which is costly to obtain. Semi-supervised semantic segmentation (SSSS) has emerged as a solution to alleviate the labeling burden…

Computer Vision and Pattern Recognition · Computer Science 2025-07-22 Danhui Chen , Ziquan Liu , Chuxi Yang , Dan Wang , Yan Yan , Yi Xu , Xiangyang Ji

Semantic-aware SAM for Point-Prompted Instance Segmentation

Single-point annotation in visual tasks, with the goal of minimizing labelling costs, is becoming increasingly prominent in research. Recently, visual foundation models, such as Segment Anything (SAM), have gained widespread usage due to…

Computer Vision and Pattern Recognition · Computer Science 2024-05-28 Zhaoyang Wei , Pengfei Chen , Xuehui Yu , Guorong Li , Jianbin Jiao , Zhenjun Han

AV-SAM: Segment Anything Model Meets Audio-Visual Localization and Segmentation

Segment Anything Model (SAM) has recently shown its powerful effectiveness in visual segmentation tasks. However, there is less exploration concerning how SAM works on audio-visual tasks, such as visual sound localization and segmentation.…

Computer Vision and Pattern Recognition · Computer Science 2023-05-04 Shentong Mo , Yapeng Tian

SamDSK: Combining Segment Anything Model with Domain-Specific Knowledge for Semi-Supervised Learning in Medical Image Segmentation

The Segment Anything Model (SAM) exhibits a capability to segment a wide array of objects in natural images, serving as a versatile perceptual tool for various downstream image segmentation tasks. In contrast, medical image segmentation…

Computer Vision and Pattern Recognition · Computer Science 2023-08-29 Yizhe Zhang , Tao Zhou , Shuo Wang , Ye Wu , Pengfei Gu , Danny Z. Chen

Can Unsupervised Segmentation Reduce Annotation Costs for Video Semantic Segmentation?

Present-day deep neural networks for video semantic segmentation require a large number of fine-grained pixel-level annotations to achieve the best possible results. Obtaining such annotations, however, is very expensive. On the other hand,…

Computer Vision and Pattern Recognition · Computer Science 2026-03-31 Samik Some , Vinay P. Namboodiri

Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale Dataset

With the breakthrough of large models, Segment Anything Model (SAM) and its extensions have been attempted to apply in diverse tasks of computer vision. Underwater salient instance segmentation is a foundational and vital step for various…

Computer Vision and Pattern Recognition · Computer Science 2025-05-28 Shijie Lian , Ziyi Zhang , Hua Li , Wenjie Li , Laurence Tianruo Yang , Sam Kwong , Runmin Cong

SEG-SAM: Semantic-Guided SAM for Unified Medical Image Segmentation

Recently, developing unified medical image segmentation models gains increasing attention, especially with the advent of the Segment Anything Model (SAM). SAM has shown promising binary segmentation performance in natural domains, however,…

Computer Vision and Pattern Recognition · Computer Science 2024-12-18 Shuangping Huang , Hao Liang , Qingfeng Wang , Chulong Zhong , Zijian Zhou , Miaojing Shi

Char-SAM: Turning Segment Anything Model into Scene Text Segmentation Annotator with Character-level Visual Prompts

The recent emergence of the Segment Anything Model (SAM) enables various domain-specific segmentation tasks to be tackled cost-effectively by using bounding boxes as prompts. However, in scene text segmentation, SAM can not achieve…

Computer Vision and Pattern Recognition · Computer Science 2024-12-31 Enze Xie , Jiaho Lyu , Daiqing Wu , Huawen Shen , Yu Zhou

Input Augmentation with SAM: Boosting Medical Image Segmentation with Segmentation Foundation Model

The Segment Anything Model (SAM) is a recently developed large model for general-purpose segmentation for computer vision tasks. SAM was trained using 11 million images with over 1 billion masks and can produce segmentation results for a…

Computer Vision and Pattern Recognition · Computer Science 2023-06-22 Yizhe Zhang , Tao Zhou , Shuo Wang , Peixian Liang , Danny Z. Chen

There is no SAMantics! Exploring SAM as a Backbone for Visual Understanding Tasks

The Segment Anything Model (SAM) was originally designed for label-agnostic mask generation. Does this model also possess inherent semantic understanding, of value to broader visual tasks? In this work we follow a multi-staged approach…

Computer Vision and Pattern Recognition · Computer Science 2024-11-26 Miguel Espinosa , Chenhongyi Yang , Linus Ericsson , Steven McDonagh , Elliot J. Crowley

Segment Anything Meets Semantic Communication

In light of the diminishing returns of traditional methods for enhancing transmission rates, the domain of semantic communication presents promising new frontiers. Focusing on image transmission, this paper explores the application of…

Computer Vision and Pattern Recognition · Computer Science 2023-06-06 Shehbaz Tariq , Brian Estadimas Arfeto , Chaoning Zhang , Hyundong Shin

SimSAM: Zero-shot Medical Image Segmentation via Simulated Interaction

The recently released Segment Anything Model (SAM) has shown powerful zero-shot segmentation capabilities through a semi-automatic annotation setup in which the user can provide a prompt in the form of clicks or bounding boxes. There is…

Computer Vision and Pattern Recognition · Computer Science 2024-06-04 Benjamin Towle , Xin Chen , Ke Zhou