English
Related papers

Related papers: Segment Anything, Even Occluded

200 papers

Amodal segmentation is a challenging task that aims to predict the complete geometric shape of objects, including their occluded regions. Although existing methods primarily focus on amodal segmentation within the training domain, these…

Computer Vision and Pattern Recognition · Computer Science 2026-04-23 Bo Zhang , Zhuotao Tian , Xin Tao , Songlin Tang , Jun Yu , Wenjie Pei

Segment Anything (SAM) has recently pushed the boundaries of segmentation by demonstrating zero-shot generalization and flexible prompting after training on over one billion masks. Despite this, its mask prediction accuracy often falls…

Computer Vision and Pattern Recognition · Computer Science 2026-01-21 Zezhong Fan , Xiaohan Li , Topojoy Biswas , Kaushiki Nag , Kannan Achan

Handling occlusion remains a significant challenge for video instance-level tasks like Multiple Object Tracking (MOT) and Video Instance Segmentation (VIS). In this paper, we propose a novel framework, Amodal-Aware Video Instance…

Computer Vision and Pattern Recognition · Computer Science 2025-04-11 Minh Tran , Thang Pham , Winston Bounsavy , Tri Nguyen , Ngan Le

Robust and accurate segmentation of scenes has become one core functionality in various visual recognition and navigation tasks. This has inspired the recent development of Segment Anything Model (SAM), a foundation model for general mask…

Computer Vision and Pattern Recognition · Computer Science 2024-08-20 Aoran Xiao , Weihao Xuan , Heli Qi , Yun Xing , Naoto Yokoya , Shijian Lu

Amodal perception requires inferring the full shape of an object that is partially occluded. This task is particularly challenging on two levels: (1) it requires more information than what is contained in the instant retina or imaging…

Computer Vision and Pattern Recognition · Computer Science 2022-10-25 Jian Yao , Yuxin Hong , Chiyu Wang , Tianjun Xiao , Tong He , Francesco Locatello , David Wipf , Yanwei Fu , Zheng Zhang

Semantic amodal segmentation is a recently proposed extension to instance-aware segmentation that includes the prediction of the invisible region of each object instance. We present the first all-in-one end-to-end trainable model for…

Computer Vision and Pattern Recognition · Computer Science 2018-04-25 Patrick Follmann , Rebecca König , Philipp Härtinger , Michael Klostermann

Multimodal image fusion and semantic segmentation are critical for autonomous driving. Despite advancements, current models often struggle with segmenting densely packed elements due to a lack of comprehensive fusion features for guidance…

Computer Vision and Pattern Recognition · Computer Science 2025-06-25 Daixun Li , Weiying Xie , Mingxiang Cao , Yunke Wang , Yusi Zhang , Leyuan Fang , Yunsong Li , Chang Xu

The Segment Anything Model (SAM) is a deep neural network foundational model designed to perform instance segmentation which has gained significant popularity given its zero-shot segmentation ability. SAM operates by generating masks based…

Computer Vision and Pattern Recognition · Computer Science 2024-04-19 Yona Falinie A. Gaus , Neelanjan Bhowmik , Brian K. S. Isaac-Medina , Toby P. Breckon

Segmentation of surgical instruments is crucial for enhancing surgeon performance and ensuring patient safety. Conventional techniques such as binary, semantic, and instance segmentation share a common drawback: they do not accommodate the…

Computer Vision and Pattern Recognition · Computer Science 2024-08-05 Ruohua Shi , Zhaochen Liu , Lingyu Duan , Tingting Jiang

Recently, promptable segmentation models, such as the Segment Anything Model (SAM), have demonstrated robust zero-shot generalization capabilities on static images. These promptable models exhibit denoising abilities for imprecise prompt…

Computer Vision and Pattern Recognition · Computer Science 2024-03-08 Tao Zhou , Wenhan Luo , Qi Ye , Zhiguo Shi , Jiming Chen

Meta AI Research has recently released SAM (Segment Anything Model) which is trained on a large segmentation dataset of over 1 billion masks. As a foundation model in the field of computer vision, SAM (Segment Anything Model) has gained…

Computer Vision and Pattern Recognition · Computer Science 2023-05-02 Dongsheng Han , Chaoning Zhang , Yu Qiao , Maryam Qamar , Yuna Jung , SeungKyu Lee , Sung-Ho Bae , Choong Seon Hong

Segment Anything Model (SAM) has recently shown its powerful effectiveness in visual segmentation tasks. However, there is less exploration concerning how SAM works on audio-visual tasks, such as visual sound localization and segmentation.…

Computer Vision and Pattern Recognition · Computer Science 2023-05-04 Shentong Mo , Yapeng Tian

Although most existing multi-modal salient object detection (SOD) methods demonstrate effectiveness through training models from scratch, the limited multi-modal data hinders these methods from reaching optimality. In this paper, we propose…

Computer Vision and Pattern Recognition · Computer Science 2024-11-13 Kunpeng Wang , Danying Lin , Chenglong Li , Zhengzheng Tu , Bin Luo

The Segment Anything Model (SAM) is the first foundation model for general image segmentation. It has achieved impressive results on various natural image segmentation tasks. However, medical image segmentation (MIS) is more challenging…

Image and Video Processing · Electrical Eng. & Systems 2024-01-18 Yuhao Huang , Xin Yang , Lian Liu , Han Zhou , Ao Chang , Xinrui Zhou , Rusi Chen , Junxuan Yu , Jiongquan Chen , Chaoyu Chen , Sijing Liu , Haozhe Chi , Xindi Hu , Kejuan Yue , Lei Li , Vicente Grau , Deng-Ping Fan , Fajin Dong , Dong Ni

The Segment Anything Model (SAM), a foundational model designed for promptable segmentation tasks, demonstrates exceptional generalization capabilities, making it highly promising for natural scene image segmentation. However, SAM's lack of…

Computer Vision and Pattern Recognition · Computer Science 2024-08-19 Linghao Zheng , Xinyang Pu , Feng Xu

The Segment Anything Model (SAM), introduced by Meta AI Research as a generic object segmentation model, quickly garnered widespread attention and significantly influenced the academic community. To extend its application to video, Meta…

Computer Vision and Pattern Recognition · Computer Science 2024-08-01 Lv Tang , Bo Li

Salient Object Detection (SOD) aims to identify and segment the most prominent objects in images. Advanced SOD methods often utilize various Convolutional Neural Networks (CNN) or Transformers for deep feature extraction. However, these…

Computer Vision and Pattern Recognition · Computer Science 2024-08-09 Shixuan Gao , Pingping Zhang , Tianyu Yan , Huchuan Lu

We introduce the Segment Anything (SA) project: a new task, model, and dataset for image segmentation. Using our efficient model in a data collection loop, we built the largest segmentation dataset to date (by far), with over 1 billion…

Computer Vision and Pattern Recognition · Computer Science 2023-04-06 Alexander Kirillov , Eric Mintun , Nikhila Ravi , Hanzi Mao , Chloe Rolland , Laura Gustafson , Tete Xiao , Spencer Whitehead , Alexander C. Berg , Wan-Yen Lo , Piotr Dollár , Ross Girshick

The robust association of the same objects across video frames in complex scenes is crucial for many applications, especially Multiple Object Tracking (MOT). Current methods predominantly rely on labeled domain-specific video datasets,…

Computer Vision and Pattern Recognition · Computer Science 2024-06-07 Siyuan Li , Lei Ke , Martin Danelljan , Luigi Piccinelli , Mattia Segu , Luc Van Gool , Fisher Yu

Video Object Segmentation (VOS) task aims to segmenting a particular object instance throughout the entire video sequence given only the object mask of the first frame. Recently, Segment Anything Model 2 (SAM 2) is proposed, which is a…

Computer Vision and Pattern Recognition · Computer Science 2024-08-27 Feiyu Pan , Hao Fang , Runmin Cong , Wei Zhang , Xiankai Lu
‹ Prev 1 2 3 10 Next ›