Related papers: Semantic-Fast-SAM: Efficient Semantic Segmenter

SSP-SAM: SAM with Semantic-Spatial Prompt for Referring Expression Segmentation

The Segment Anything Model (SAM) excels at general image segmentation but has limited ability to understand natural language, which restricts its direct application in Referring Expression Segmentation (RES). Toward this end, we propose…

Computer Vision and Pattern Recognition · Computer Science 2026-03-20 Wei Tang , Xuejing Liu , Yanpeng Sun , Zechao Li

Propagating Semantic Labels in Video Data

Semantic Segmentation combines two sub-tasks: the identification of pixel-level image masks and the application of semantic labels to those masks. Recently, so-called Foundation Models have been introduced; general models trained on very…

Computer Vision and Pattern Recognition · Computer Science 2023-10-03 David Balaban , Justin Medich , Pranay Gosar , Justin Hart

Fast Segment Anything

The recently proposed segment anything model (SAM) has made a significant influence in many computer vision tasks. It is becoming a foundation step for many high-level tasks, like image segmentation, image caption, and image editing.…

Computer Vision and Pattern Recognition · Computer Science 2023-06-22 Xu Zhao , Wenchao Ding , Yongqi An , Yinglong Du , Tao Yu , Min Li , Ming Tang , Jinqiao Wang

Segment Anything Meets Semantic Communication

In light of the diminishing returns of traditional methods for enhancing transmission rates, the domain of semantic communication presents promising new frontiers. Focusing on image transmission, this paper explores the application of…

Computer Vision and Pattern Recognition · Computer Science 2023-06-06 Shehbaz Tariq , Brian Estadimas Arfeto , Chaoning Zhang , Hyundong Shin

MobileSAMv2: Faster Segment Anything to Everything

Segment anything model (SAM) addresses two practical yet challenging segmentation tasks: \textbf{segment anything (SegAny)}, which utilizes a certain point to predict the mask for a single object of interest, and \textbf{segment everything…

Computer Vision and Pattern Recognition · Computer Science 2023-12-18 Chaoning Zhang , Dongshen Han , Sheng Zheng , Jinwoo Choi , Tae-Ho Kim , Choong Seon Hong

Repurposing SAM for User-Defined Semantics Aware Segmentation

The Segment Anything Model (SAM) excels at generating precise object masks from input prompts but lacks semantic awareness, failing to associate its generated masks with specific object categories. To address this limitation, we propose…

Computer Vision and Pattern Recognition · Computer Science 2025-09-04 Rohit Kundu , Sudipta Paul , Arindam Dutta , Amit K. Roy-Chowdhury

AM-SAM: Automated Prompting and Mask Calibration for Segment Anything Model

Segment Anything Model (SAM) has gained significant recognition in the field of semantic segmentation due to its versatile capabilities and impressive performance. Despite its success, SAM faces two primary limitations: (1) it relies…

Computer Vision and Pattern Recognition · Computer Science 2026-01-21 Yuchen Li , Li Zhang , Youwei Liang , Pengtao Xie

Do Instance Priors Help Weakly Supervised Semantic Segmentation?

Semantic segmentation requires dense pixel-level annotations, which are costly and time-consuming to acquire. To address this, we present SeSAM, a framework that uses a foundational segmentation model, i.e. Segment Anything Model (SAM),…

Computer Vision and Pattern Recognition · Computer Science 2026-04-14 Anurag Das , Anna Kukleva , Xinting Hu , Yuki M. Asano , Bernt Schiele

Effective SAM Combination for Open-Vocabulary Semantic Segmentation

Open-vocabulary semantic segmentation aims to assign pixel-level labels to images across an unlimited range of classes. Traditional methods address this by sequentially connecting a powerful mask proposal generator, such as the Segment…

Computer Vision and Pattern Recognition · Computer Science 2025-04-01 Minhyeok Lee , Suhwan Cho , Jungho Lee , Sunghun Yang , Heeseung Choi , Ig-Jae Kim , Sangyoun Lee

Segment Anything Model (SAM) Enhanced Pseudo Labels for Weakly Supervised Semantic Segmentation

Weakly supervised semantic segmentation (WSSS) aims to bypass the need for laborious pixel-level annotation by using only image-level annotation. Most existing methods rely on Class Activation Maps (CAM) to derive pixel-level pseudo-labels…

Computer Vision and Pattern Recognition · Computer Science 2023-11-07 Tianle Chen , Zheda Mai , Ruiwen Li , Wei-lun Chao

On Efficient Variants of Segment Anything Model: A Survey

The Segment Anything Model (SAM) is a foundational model for image segmentation tasks, known for its strong generalization across diverse applications. However, its impressive performance comes with significant computational and resource…

Computer Vision and Pattern Recognition · Computer Science 2026-04-15 Xiaorui Sun , Jun Liu , Heng Tao Shen , Xiaofeng Zhu , Ping Hu

Learning to Prompt Segment Anything Models

Segment Anything Models (SAMs) like SEEM and SAM have demonstrated great potential in learning to segment anything. The core design of SAMs lies with Promptable Segmentation, which takes a handcrafted prompt as input and returns the…

Computer Vision and Pattern Recognition · Computer Science 2024-01-10 Jiaxing Huang , Kai Jiang , Jingyi Zhang , Han Qiu , Lewei Lu , Shijian Lu , Eric Xing

ConformalSAM: Unlocking the Potential of Foundational Segmentation Models in Semi-Supervised Semantic Segmentation with Conformal Prediction

Pixel-level vision tasks, such as semantic segmentation, require extensive and high-quality annotated data, which is costly to obtain. Semi-supervised semantic segmentation (SSSS) has emerged as a solution to alleviate the labeling burden…

Computer Vision and Pattern Recognition · Computer Science 2025-07-22 Danhui Chen , Ziquan Liu , Chuxi Yang , Dan Wang , Yan Yan , Yi Xu , Xiangyang Ji

Segment Anything in High Quality

The recent Segment Anything Model (SAM) represents a big leap in scaling up segmentation models, allowing for powerful zero-shot capabilities and flexible prompting. Despite being trained with 1.1 billion masks, SAM's mask prediction…

Computer Vision and Pattern Recognition · Computer Science 2023-10-24 Lei Ke , Mingqiao Ye , Martin Danelljan , Yifan Liu , Yu-Wing Tai , Chi-Keung Tang , Fisher Yu

Semantic-SAM: Segment and Recognize Anything at Any Granularity

In this paper, we introduce Semantic-SAM, a universal image segmentation model to enable segment and recognize anything at any desired granularity. Our model offers two key advantages: semantic-awareness and granularity-abundance. To…

Computer Vision and Pattern Recognition · Computer Science 2023-07-11 Feng Li , Hao Zhang , Peize Sun , Xueyan Zou , Shilong Liu , Jianwei Yang , Chunyuan Li , Lei Zhang , Jianfeng Gao

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

Segment Anything Model (SAM) has emerged as a powerful tool for numerous vision applications. A key component that drives the impressive performance for zero-shot transfer and high versatility is a super large Transformer model trained on…

Computer Vision and Pattern Recognition · Computer Science 2023-12-05 Yunyang Xiong , Bala Varadarajan , Lemeng Wu , Xiaoyu Xiang , Fanyi Xiao , Chenchen Zhu , Xiaoliang Dai , Dilin Wang , Fei Sun , Forrest Iandola , Raghuraman Krishnamoorthi , Vikas Chandra

FastSmoothSAM: A Fast Smooth Method For Segment Anything Model

Accurately identifying and representing object edges is a challenging task in computer vision and image processing. The Segment Anything Model (SAM) has significantly influenced the field of image segmentation, but suffers from high memory…

Computer Vision and Pattern Recognition · Computer Science 2025-07-22 Jiasheng Xu , Yewang Chen

SEG-SAM: Semantic-Guided SAM for Unified Medical Image Segmentation

Recently, developing unified medical image segmentation models gains increasing attention, especially with the advent of the Segment Anything Model (SAM). SAM has shown promising binary segmentation performance in natural domains, however,…

Computer Vision and Pattern Recognition · Computer Science 2024-12-18 Shuangping Huang , Hao Liang , Qingfeng Wang , Chulong Zhong , Zijian Zhou , Miaojing Shi

Semantic Localization Guiding Segment Anything Model For Reference Remote Sensing Image Segmentation

The Reference Remote Sensing Image Segmentation (RRSIS) task generates segmentation masks for specified objects in images based on textual descriptions, which has attracted widespread attention and research interest. Current RRSIS methods…

Computer Vision and Pattern Recognition · Computer Science 2025-06-13 Shuyang Li , Shuang Wang , Zhuangzhuang Sun , Jing Xiao

CLIP-Guided SAM: Parameter-Efficient Semantic Conditioning for Promptable Segmentation

Promptable foundation models such as the Segment Anything Model (SAM) produce high-quality masks but remain semantically blind, relying on external prompts to specify categories. Existing vision-language approaches address this limitation…

Computer Vision and Pattern Recognition · Computer Science 2026-05-26 Shayan Jalilian , Abdul Bais