Related papers: Segment Anything with Multiple Modalities

Multimodal SAM-adapter for Semantic Segmentation

Semantic segmentation, a key task in computer vision with broad applications in autonomous driving, medical imaging, and robotics, has advanced substantially with deep learning. Nevertheless, current approaches remain vulnerable to…

Computer Vision and Pattern Recognition · Computer Science 2025-09-15 Iacopo Curti , Pierluigi Zama Ramirez , Alioscia Petrelli , Luigi Di Stefano

Segment and Matte Anything in a Unified Model

Segment Anything (SAM) has recently pushed the boundaries of segmentation by demonstrating zero-shot generalization and flexible prompting after training on over one billion masks. Despite this, its mask prediction accuracy often falls…

Computer Vision and Pattern Recognition · Computer Science 2026-01-21 Zezhong Fan , Xiaohan Li , Topojoy Biswas , Kaushiki Nag , Kannan Achan

Tuning a SAM-Based Model with Multi-Cognitive Visual Adapter to Remote Sensing Instance Segmentation

The Segment Anything Model (SAM), a foundational model designed for promptable segmentation tasks, demonstrates exceptional generalization capabilities, making it highly promising for natural scene image segmentation. However, SAM's lack of…

Computer Vision and Pattern Recognition · Computer Science 2024-08-19 Linghao Zheng , Xinyang Pu , Feng Xu

On Efficient Variants of Segment Anything Model: A Survey

The Segment Anything Model (SAM) is a foundational model for image segmentation tasks, known for its strong generalization across diverse applications. However, its impressive performance comes with significant computational and resource…

Computer Vision and Pattern Recognition · Computer Science 2026-04-15 Xiaorui Sun , Jun Liu , Heng Tao Shen , Xiaofeng Zhu , Ping Hu

FusionSAM: Visual Multi-Modal Learning with Segment Anything

Multimodal image fusion and semantic segmentation are critical for autonomous driving. Despite advancements, current models often struggle with segmenting densely packed elements due to a lack of comprehensive fusion features for guidance…

Computer Vision and Pattern Recognition · Computer Science 2025-06-25 Daixun Li , Weiying Xie , Mingxiang Cao , Yunke Wang , Yusi Zhang , Leyuan Fang , Yunsong Li , Chang Xu

Adapting Segment Anything Model to Multi-modal Salient Object Detection with Semantic Feature Fusion Guidance

Although most existing multi-modal salient object detection (SOD) methods demonstrate effectiveness through training models from scratch, the limited multi-modal data hinders these methods from reaching optimality. In this paper, we propose…

Computer Vision and Pattern Recognition · Computer Science 2024-11-13 Kunpeng Wang , Danying Lin , Chenglong Li , Zhengzheng Tu , Bin Luo

Foundation Models for Biomedical Image Segmentation: A Survey

Recent advancements in biomedical image analysis have been significantly driven by the Segment Anything Model (SAM). This transformative technology, originally developed for general-purpose computer vision, has found rapid application in…

Computer Vision and Pattern Recognition · Computer Science 2024-01-17 Ho Hin Lee , Yu Gu , Theodore Zhao , Yanbo Xu , Jianwei Yang , Naoto Usuyama , Cliff Wong , Mu Wei , Bennett A. Landman , Yuankai Huo , Alberto Santamaria-Pang , Hoifung Poon

Customize Segment Anything Model for Multi-Modal Semantic Segmentation with Mixture of LoRA Experts

The recent Segment Anything Model (SAM) represents a significant breakthrough in scaling segmentation models, delivering strong performance across various downstream applications in the RGB modality. However, directly applying SAM to…

Computer Vision and Pattern Recognition · Computer Science 2024-12-06 Chenyang Zhu , Bin Xiao , Lin Shi , Shoukun Xu , Xu Zheng

Segment Any Medical Model Extended

The Segment Anything Model (SAM) has drawn significant attention from researchers who work on medical image segmentation because of its generalizability. However, researchers have found that SAM may have limited performance on medical…

Computer Vision and Pattern Recognition · Computer Science 2024-03-28 Yihao Liu , Jiaming Zhang , Andres Diaz-Pinto , Haowei Li , Alejandro Martin-Gomez , Amir Kheradmand , Mehran Armand

An Empirical Study on the Robustness of the Segment Anything Model (SAM)

The Segment Anything Model (SAM) is a foundation model for general image segmentation. Although it exhibits impressive performance predominantly on natural images, understanding its robustness against various image perturbations and domains…

Computer Vision and Pattern Recognition · Computer Science 2023-05-25 Yuqing Wang , Yun Zhao , Linda Petzold

Performance Evaluation of Segment Anything Model with Variational Prompting for Application to Non-Visible Spectrum Imagery

The Segment Anything Model (SAM) is a deep neural network foundational model designed to perform instance segmentation which has gained significant popularity given its zero-shot segmentation ability. SAM operates by generating masks based…

Computer Vision and Pattern Recognition · Computer Science 2024-04-19 Yona Falinie A. Gaus , Neelanjan Bhowmik , Brian K. S. Isaac-Medina , Toby P. Breckon

Segment Anything Model for Medical Image Segmentation: Current Applications and Future Directions

Due to the inherent flexibility of prompting, foundation models have emerged as the predominant force in the fields of natural language processing and computer vision. The recent introduction of the Segment Anything Model (SAM) signifies a…

Image and Video Processing · Electrical Eng. & Systems 2024-01-09 Yichi Zhang , Zhenrong Shen , Rushi Jiao

A Survey on Segment Anything Model (SAM): Vision Foundation Model Meets Prompt Engineering

The Segment Anything Model (SAM), developed by Meta AI Research, represents a significant breakthrough in computer vision, offering a robust framework for image and video segmentation. This survey provides a comprehensive exploration of the…

Computer Vision and Pattern Recognition · Computer Science 2024-10-22 Chaoning Zhang , Joseph Cho , Fachrina Dewi Puspitasari , Sheng Zheng , Chenghao Li , Yu Qiao , Taegoo Kang , Xinru Shan , Chenshuang Zhang , Caiyan Qin , Francois Rameau , Lik-Hang Lee , Sung-Ho Bae , Choong Seon Hong

Segment Any RGB-Thermal Model with Language-aided Distillation

The recent Segment Anything Model (SAM) demonstrates strong instance segmentation performance across various downstream tasks. However, SAM is trained solely on RGB data, limiting its direct applicability to RGB-thermal (RGB-T) semantic…

Computer Vision and Pattern Recognition · Computer Science 2025-05-06 Dong Xing , Xianxun Zhu , Wei Zhou , Qika Lin , Hang Yang , Yuqing Wang

Segment Anything Model for Medical Images?

The Segment Anything Model (SAM) is the first foundation model for general image segmentation. It has achieved impressive results on various natural image segmentation tasks. However, medical image segmentation (MIS) is more challenging…

Image and Video Processing · Electrical Eng. & Systems 2024-01-18 Yuhao Huang , Xin Yang , Lian Liu , Han Zhou , Ao Chang , Xinrui Zhou , Rusi Chen , Junxuan Yu , Jiongquan Chen , Chaoyu Chen , Sijing Liu , Haozhe Chi , Xindi Hu , Kejuan Yue , Lei Li , Vicente Grau , Deng-Ping Fan , Fajin Dong , Dong Ni

CC-SAM: SAM with Cross-feature Attention and Context for Ultrasound Image Segmentation

The Segment Anything Model (SAM) has achieved remarkable successes in the realm of natural image segmentation, but its deployment in the medical imaging sphere has encountered challenges. Specifically, the model struggles with medical…

Computer Vision and Pattern Recognition · Computer Science 2024-08-02 Shreyank N Gowda , David A. Clifton

Segment Anything Model for Medical Image Analysis: an Experimental Study

Training segmentation models for medical images continues to be challenging due to the limited availability of data annotations. Segment Anything Model (SAM) is a foundation model that is intended to segment user-defined objects of interest…

Computer Vision and Pattern Recognition · Computer Science 2023-08-09 Maciej A. Mazurowski , Haoyu Dong , Hanxue Gu , Jichen Yang , Nicholas Konz , Yixin Zhang

AV-SAM: Segment Anything Model Meets Audio-Visual Localization and Segmentation

Segment Anything Model (SAM) has recently shown its powerful effectiveness in visual segmentation tasks. However, there is less exploration concerning how SAM works on audio-visual tasks, such as visual sound localization and segmentation.…

Computer Vision and Pattern Recognition · Computer Science 2023-05-04 Shentong Mo , Yapeng Tian

Multi-Scale and Detail-Enhanced Segment Anything Model for Salient Object Detection

Salient Object Detection (SOD) aims to identify and segment the most prominent objects in images. Advanced SOD methods often utilize various Convolutional Neural Networks (CNN) or Transformers for deep feature extraction. However, these…

Computer Vision and Pattern Recognition · Computer Science 2024-08-09 Shixuan Gao , Pingping Zhang , Tianyu Yan , Huchuan Lu

Segment Anything, Even Occluded

Amodal instance segmentation, which aims to detect and segment both visible and invisible parts of objects in images, plays a crucial role in various applications including autonomous driving, robotic manipulation, and scene understanding.…

Computer Vision and Pattern Recognition · Computer Science 2025-03-11 Wei-En Tai , Yu-Lin Shih , Cheng Sun , Yu-Chiang Frank Wang , Hwann-Tzong Chen