English
Related papers

Related papers: Segment Anything with Multiple Modalities

200 papers

Semantic segmentation, a key task in computer vision with broad applications in autonomous driving, medical imaging, and robotics, has advanced substantially with deep learning. Nevertheless, current approaches remain vulnerable to…

Computer Vision and Pattern Recognition · Computer Science 2025-09-15 Iacopo Curti , Pierluigi Zama Ramirez , Alioscia Petrelli , Luigi Di Stefano

Segment Anything (SAM) has recently pushed the boundaries of segmentation by demonstrating zero-shot generalization and flexible prompting after training on over one billion masks. Despite this, its mask prediction accuracy often falls…

Computer Vision and Pattern Recognition · Computer Science 2026-01-21 Zezhong Fan , Xiaohan Li , Topojoy Biswas , Kaushiki Nag , Kannan Achan

The Segment Anything Model (SAM), a foundational model designed for promptable segmentation tasks, demonstrates exceptional generalization capabilities, making it highly promising for natural scene image segmentation. However, SAM's lack of…

Computer Vision and Pattern Recognition · Computer Science 2024-08-19 Linghao Zheng , Xinyang Pu , Feng Xu

The Segment Anything Model (SAM) is a foundational model for image segmentation tasks, known for its strong generalization across diverse applications. However, its impressive performance comes with significant computational and resource…

Computer Vision and Pattern Recognition · Computer Science 2026-04-15 Xiaorui Sun , Jun Liu , Heng Tao Shen , Xiaofeng Zhu , Ping Hu

Multimodal image fusion and semantic segmentation are critical for autonomous driving. Despite advancements, current models often struggle with segmenting densely packed elements due to a lack of comprehensive fusion features for guidance…

Computer Vision and Pattern Recognition · Computer Science 2025-06-25 Daixun Li , Weiying Xie , Mingxiang Cao , Yunke Wang , Yusi Zhang , Leyuan Fang , Yunsong Li , Chang Xu

Although most existing multi-modal salient object detection (SOD) methods demonstrate effectiveness through training models from scratch, the limited multi-modal data hinders these methods from reaching optimality. In this paper, we propose…

Computer Vision and Pattern Recognition · Computer Science 2024-11-13 Kunpeng Wang , Danying Lin , Chenglong Li , Zhengzheng Tu , Bin Luo

Recent advancements in biomedical image analysis have been significantly driven by the Segment Anything Model (SAM). This transformative technology, originally developed for general-purpose computer vision, has found rapid application in…

Computer Vision and Pattern Recognition · Computer Science 2024-01-17 Ho Hin Lee , Yu Gu , Theodore Zhao , Yanbo Xu , Jianwei Yang , Naoto Usuyama , Cliff Wong , Mu Wei , Bennett A. Landman , Yuankai Huo , Alberto Santamaria-Pang , Hoifung Poon

The recent Segment Anything Model (SAM) represents a significant breakthrough in scaling segmentation models, delivering strong performance across various downstream applications in the RGB modality. However, directly applying SAM to…

Computer Vision and Pattern Recognition · Computer Science 2024-12-06 Chenyang Zhu , Bin Xiao , Lin Shi , Shoukun Xu , Xu Zheng

The Segment Anything Model (SAM) has drawn significant attention from researchers who work on medical image segmentation because of its generalizability. However, researchers have found that SAM may have limited performance on medical…

Computer Vision and Pattern Recognition · Computer Science 2024-03-28 Yihao Liu , Jiaming Zhang , Andres Diaz-Pinto , Haowei Li , Alejandro Martin-Gomez , Amir Kheradmand , Mehran Armand

The Segment Anything Model (SAM) is a foundation model for general image segmentation. Although it exhibits impressive performance predominantly on natural images, understanding its robustness against various image perturbations and domains…

Computer Vision and Pattern Recognition · Computer Science 2023-05-25 Yuqing Wang , Yun Zhao , Linda Petzold

The Segment Anything Model (SAM) is a deep neural network foundational model designed to perform instance segmentation which has gained significant popularity given its zero-shot segmentation ability. SAM operates by generating masks based…

Computer Vision and Pattern Recognition · Computer Science 2024-04-19 Yona Falinie A. Gaus , Neelanjan Bhowmik , Brian K. S. Isaac-Medina , Toby P. Breckon

Due to the inherent flexibility of prompting, foundation models have emerged as the predominant force in the fields of natural language processing and computer vision. The recent introduction of the Segment Anything Model (SAM) signifies a…

Image and Video Processing · Electrical Eng. & Systems 2024-01-09 Yichi Zhang , Zhenrong Shen , Rushi Jiao

The Segment Anything Model (SAM), developed by Meta AI Research, represents a significant breakthrough in computer vision, offering a robust framework for image and video segmentation. This survey provides a comprehensive exploration of the…

The recent Segment Anything Model (SAM) demonstrates strong instance segmentation performance across various downstream tasks. However, SAM is trained solely on RGB data, limiting its direct applicability to RGB-thermal (RGB-T) semantic…

Computer Vision and Pattern Recognition · Computer Science 2025-05-06 Dong Xing , Xianxun Zhu , Wei Zhou , Qika Lin , Hang Yang , Yuqing Wang

The Segment Anything Model (SAM) is the first foundation model for general image segmentation. It has achieved impressive results on various natural image segmentation tasks. However, medical image segmentation (MIS) is more challenging…

Image and Video Processing · Electrical Eng. & Systems 2024-01-18 Yuhao Huang , Xin Yang , Lian Liu , Han Zhou , Ao Chang , Xinrui Zhou , Rusi Chen , Junxuan Yu , Jiongquan Chen , Chaoyu Chen , Sijing Liu , Haozhe Chi , Xindi Hu , Kejuan Yue , Lei Li , Vicente Grau , Deng-Ping Fan , Fajin Dong , Dong Ni

The Segment Anything Model (SAM) has achieved remarkable successes in the realm of natural image segmentation, but its deployment in the medical imaging sphere has encountered challenges. Specifically, the model struggles with medical…

Computer Vision and Pattern Recognition · Computer Science 2024-08-02 Shreyank N Gowda , David A. Clifton

Training segmentation models for medical images continues to be challenging due to the limited availability of data annotations. Segment Anything Model (SAM) is a foundation model that is intended to segment user-defined objects of interest…

Computer Vision and Pattern Recognition · Computer Science 2023-08-09 Maciej A. Mazurowski , Haoyu Dong , Hanxue Gu , Jichen Yang , Nicholas Konz , Yixin Zhang

Segment Anything Model (SAM) has recently shown its powerful effectiveness in visual segmentation tasks. However, there is less exploration concerning how SAM works on audio-visual tasks, such as visual sound localization and segmentation.…

Computer Vision and Pattern Recognition · Computer Science 2023-05-04 Shentong Mo , Yapeng Tian

Salient Object Detection (SOD) aims to identify and segment the most prominent objects in images. Advanced SOD methods often utilize various Convolutional Neural Networks (CNN) or Transformers for deep feature extraction. However, these…

Computer Vision and Pattern Recognition · Computer Science 2024-08-09 Shixuan Gao , Pingping Zhang , Tianyu Yan , Huchuan Lu

Amodal instance segmentation, which aims to detect and segment both visible and invisible parts of objects in images, plays a crucial role in various applications including autonomous driving, robotic manipulation, and scene understanding.…

Computer Vision and Pattern Recognition · Computer Science 2025-03-11 Wei-En Tai , Yu-Lin Shih , Cheng Sun , Yu-Chiang Frank Wang , Hwann-Tzong Chen
‹ Prev 1 2 3 10 Next ›