Related papers: Segment Anything, Even Occluded

Amodal SAM: A Unified Amodal Segmentation Framework with Generalization

Amodal segmentation is a challenging task that aims to predict the complete geometric shape of objects, including their occluded regions. Although existing methods primarily focus on amodal segmentation within the training domain, these…

Computer Vision and Pattern Recognition · Computer Science 2026-04-23 Bo Zhang , Zhuotao Tian , Xin Tao , Songlin Tang , Jun Yu , Wenjie Pei

Segment and Matte Anything in a Unified Model

Segment Anything (SAM) has recently pushed the boundaries of segmentation by demonstrating zero-shot generalization and flexible prompting after training on over one billion masks. Despite this, its mask prediction accuracy often falls…

Computer Vision and Pattern Recognition · Computer Science 2026-01-21 Zezhong Fan , Xiaohan Li , Topojoy Biswas , Kaushiki Nag , Kannan Achan

A2VIS: Amodal-Aware Approach to Video Instance Segmentation

Handling occlusion remains a significant challenge for video instance-level tasks like Multiple Object Tracking (MOT) and Video Instance Segmentation (VIS). In this paper, we propose a novel framework, Amodal-Aware Video Instance…

Computer Vision and Pattern Recognition · Computer Science 2025-04-11 Minh Tran , Thang Pham , Winston Bounsavy , Tri Nguyen , Ngan Le

Segment Anything with Multiple Modalities

Robust and accurate segmentation of scenes has become one core functionality in various visual recognition and navigation tasks. This has inspired the recent development of Segment Anything Model (SAM), a foundation model for general mask…

Computer Vision and Pattern Recognition · Computer Science 2024-08-20 Aoran Xiao , Weihao Xuan , Heli Qi , Yun Xing , Naoto Yokoya , Shijian Lu

Self-supervised Amodal Video Object Segmentation

Amodal perception requires inferring the full shape of an object that is partially occluded. This task is particularly challenging on two levels: (1) it requires more information than what is contained in the instant retina or imaging…

Computer Vision and Pattern Recognition · Computer Science 2022-10-25 Jian Yao , Yuxin Hong , Chiyu Wang , Tianjun Xiao , Tong He , Francesco Locatello , David Wipf , Yanwei Fu , Zheng Zhang

Learning to See the Invisible: End-to-End Trainable Amodal Instance Segmentation

Semantic amodal segmentation is a recently proposed extension to instance-aware segmentation that includes the prediction of the invisible region of each object instance. We present the first all-in-one end-to-end trainable model for…

Computer Vision and Pattern Recognition · Computer Science 2018-04-25 Patrick Follmann , Rebecca König , Philipp Härtinger , Michael Klostermann

FusionSAM: Visual Multi-Modal Learning with Segment Anything

Multimodal image fusion and semantic segmentation are critical for autonomous driving. Despite advancements, current models often struggle with segmenting densely packed elements due to a lack of comprehensive fusion features for guidance…

Computer Vision and Pattern Recognition · Computer Science 2025-06-25 Daixun Li , Weiying Xie , Mingxiang Cao , Yunke Wang , Yusi Zhang , Leyuan Fang , Yunsong Li , Chang Xu

Performance Evaluation of Segment Anything Model with Variational Prompting for Application to Non-Visible Spectrum Imagery

The Segment Anything Model (SAM) is a deep neural network foundational model designed to perform instance segmentation which has gained significant popularity given its zero-shot segmentation ability. SAM operates by generating masks based…

Computer Vision and Pattern Recognition · Computer Science 2024-04-19 Yona Falinie A. Gaus , Neelanjan Bhowmik , Brian K. S. Isaac-Medina , Toby P. Breckon

Amodal Segmentation for Laparoscopic Surgery Video Instruments

Segmentation of surgical instruments is crucial for enhancing surgeon performance and ensuring patient safety. Conventional techniques such as binary, semantic, and instance segmentation share a common drawback: they do not accommodate the…

Computer Vision and Pattern Recognition · Computer Science 2024-08-05 Ruohua Shi , Zhaochen Liu , Lingyu Duan , Tingting Jiang

SAM-PD: How Far Can SAM Take Us in Tracking and Segmenting Anything in Videos by Prompt Denoising

Recently, promptable segmentation models, such as the Segment Anything Model (SAM), have demonstrated robust zero-shot generalization capabilities on static images. These promptable models exhibit denoising abilities for imprecise prompt…

Computer Vision and Pattern Recognition · Computer Science 2024-03-08 Tao Zhou , Wenhan Luo , Qi Ye , Zhiguo Shi , Jiming Chen

Segment Anything Model (SAM) Meets Glass: Mirror and Transparent Objects Cannot Be Easily Detected

Meta AI Research has recently released SAM (Segment Anything Model) which is trained on a large segmentation dataset of over 1 billion masks. As a foundation model in the field of computer vision, SAM (Segment Anything Model) has gained…

Computer Vision and Pattern Recognition · Computer Science 2023-05-02 Dongsheng Han , Chaoning Zhang , Yu Qiao , Maryam Qamar , Yuna Jung , SeungKyu Lee , Sung-Ho Bae , Choong Seon Hong

AV-SAM: Segment Anything Model Meets Audio-Visual Localization and Segmentation

Segment Anything Model (SAM) has recently shown its powerful effectiveness in visual segmentation tasks. However, there is less exploration concerning how SAM works on audio-visual tasks, such as visual sound localization and segmentation.…

Computer Vision and Pattern Recognition · Computer Science 2023-05-04 Shentong Mo , Yapeng Tian

Adapting Segment Anything Model to Multi-modal Salient Object Detection with Semantic Feature Fusion Guidance

Although most existing multi-modal salient object detection (SOD) methods demonstrate effectiveness through training models from scratch, the limited multi-modal data hinders these methods from reaching optimality. In this paper, we propose…

Computer Vision and Pattern Recognition · Computer Science 2024-11-13 Kunpeng Wang , Danying Lin , Chenglong Li , Zhengzheng Tu , Bin Luo

Segment Anything Model for Medical Images?

The Segment Anything Model (SAM) is the first foundation model for general image segmentation. It has achieved impressive results on various natural image segmentation tasks. However, medical image segmentation (MIS) is more challenging…

Image and Video Processing · Electrical Eng. & Systems 2024-01-18 Yuhao Huang , Xin Yang , Lian Liu , Han Zhou , Ao Chang , Xinrui Zhou , Rusi Chen , Junxuan Yu , Jiongquan Chen , Chaoyu Chen , Sijing Liu , Haozhe Chi , Xindi Hu , Kejuan Yue , Lei Li , Vicente Grau , Deng-Ping Fan , Fajin Dong , Dong Ni

Tuning a SAM-Based Model with Multi-Cognitive Visual Adapter to Remote Sensing Instance Segmentation

The Segment Anything Model (SAM), a foundational model designed for promptable segmentation tasks, demonstrates exceptional generalization capabilities, making it highly promising for natural scene image segmentation. However, SAM's lack of…

Computer Vision and Pattern Recognition · Computer Science 2024-08-19 Linghao Zheng , Xinyang Pu , Feng Xu

Evaluating SAM2's Role in Camouflaged Object Detection: From SAM to SAM2

The Segment Anything Model (SAM), introduced by Meta AI Research as a generic object segmentation model, quickly garnered widespread attention and significantly influenced the academic community. To extend its application to video, Meta…

Computer Vision and Pattern Recognition · Computer Science 2024-08-01 Lv Tang , Bo Li

Multi-Scale and Detail-Enhanced Segment Anything Model for Salient Object Detection

Salient Object Detection (SOD) aims to identify and segment the most prominent objects in images. Advanced SOD methods often utilize various Convolutional Neural Networks (CNN) or Transformers for deep feature extraction. However, these…

Computer Vision and Pattern Recognition · Computer Science 2024-08-09 Shixuan Gao , Pingping Zhang , Tianyu Yan , Huchuan Lu

Segment Anything

We introduce the Segment Anything (SA) project: a new task, model, and dataset for image segmentation. Using our efficient model in a data collection loop, we built the largest segmentation dataset to date (by far), with over 1 billion…

Computer Vision and Pattern Recognition · Computer Science 2023-04-06 Alexander Kirillov , Eric Mintun , Nikhila Ravi , Hanzi Mao , Chloe Rolland , Laura Gustafson , Tete Xiao , Spencer Whitehead , Alexander C. Berg , Wan-Yen Lo , Piotr Dollár , Ross Girshick

Matching Anything by Segmenting Anything

The robust association of the same objects across video frames in complex scenes is crucial for many applications, especially Multiple Object Tracking (MOT). Current methods predominantly rely on labeled domain-specific video datasets,…

Computer Vision and Pattern Recognition · Computer Science 2024-06-07 Siyuan Li , Lei Ke , Martin Danelljan , Luigi Piccinelli , Mattia Segu , Luc Van Gool , Fisher Yu

Video Object Segmentation via SAM 2: The 4th Solution for LSVOS Challenge VOS Track

Video Object Segmentation (VOS) task aims to segmenting a particular object instance throughout the entire video sequence given only the object mask of the first frame. Recently, Segment Anything Model 2 (SAM 2) is proposed, which is a…

Computer Vision and Pattern Recognition · Computer Science 2024-08-27 Feiyu Pan , Hao Fang , Runmin Cong , Wei Zhang , Xiankai Lu