Related papers: SOTR: Segmenting Objects with Transformers

SOIT: Segmenting Objects with Instance-Aware Transformers

This paper presents an end-to-end instance segmentation framework, termed SOIT, that Segments Objects with Instance-aware Transformers. Inspired by DETR \cite{carion2020end}, our method views instance segmentation as a direct set prediction…

Computer Vision and Pattern Recognition · Computer Science 2021-12-24 Xiaodong Yu , Dahu Shi , Xing Wei , Ye Ren , Tingqun Ye , Wenming Tan

Unknown Object Segmentation from Stereo Images

Although instance-aware perception is a key prerequisite for many autonomous robotic applications, most of the methods only partially solve the problem by focusing solely on known object categories. However, for robots interacting in…

Computer Vision and Pattern Recognition · Computer Science 2021-03-12 Maximilian Durner , Wout Boerdijk , Martin Sundermeyer , Werner Friedl , Zoltan-Csaba Marton , Rudolph Triebel

End-to-End Object Detection with Transformers

We present a new method that views object detection as a direct set prediction problem. Our approach streamlines the detection pipeline, effectively removing the need for many hand-designed components like a non-maximum suppression…

Computer Vision and Pattern Recognition · Computer Science 2020-05-29 Nicolas Carion , Francisco Massa , Gabriel Synnaeve , Nicolas Usunier , Alexander Kirillov , Sergey Zagoruyko

Semi-Supervised Hierarchical Semantic Object Parsing

Models based on Convolutional Neural Networks (CNNs) have been proven very successful for semantic segmentation and object parsing that yield hierarchies of features. Our key insight is to build convolutional networks that take input of…

Artificial Intelligence · Computer Science 2017-10-31 Jalal Mirakhorli , Hamidreza Amindavar

Transformer Assisted Convolutional Network for Cell Instance Segmentation

Region proposal based methods like R-CNN and Faster R-CNN models have proven to be extremely successful in object detection and segmentation tasks. Recently, Transformers have also gained popularity in the domain of Computer Vision, and are…

Computer Vision and Pattern Recognition · Computer Science 2021-10-07 Deepanshu Pandey , Pradyumna Gupta , Sumit Bhattacharya , Aman Sinha , Rohit Agarwal

ContourFormer: Real-Time Contour-Based End-to-End Instance Segmentation Transformer

This paper presents Contourformer, a real-time contour-based instance segmentation algorithm. The method is fully based on the DETR paradigm and achieves end-to-end inference through iterative and progressive mechanisms to optimize…

Computer Vision and Pattern Recognition · Computer Science 2025-04-16 Weiwei Yao , Chen Li , Minjun Xiong , Wenbo Dong , Hao Chen , Xiong Xiao

Transformers in Small Object Detection: A Benchmark and Survey of State-of-the-Art

Transformers have rapidly gained popularity in computer vision, especially in the field of object recognition and detection. Upon examining the outcomes of state-of-the-art object detection methods, we noticed that transformers consistently…

Computer Vision and Pattern Recognition · Computer Science 2023-09-12 Aref Miri Rekavandi , Shima Rashidi , Farid Boussaid , Stephen Hoefs , Emre Akbas , Mohammed bennamoun

High-Performance Transformers for Table Structure Recognition Need Early Convolutions

Table structure recognition (TSR) aims to convert tabular images into a machine-readable format, where a visual encoder extracts image features and a textual decoder generates table-representing tokens. Existing approaches use classic…

Computer Vision and Pattern Recognition · Computer Science 2023-11-10 ShengYun Peng , Seongmin Lee , Xiaojing Wang , Rajarajeswari Balasubramaniyan , Duen Horng Chau

Image Segmentation with transformers: An Overview, Challenges and Future

Image segmentation, a key task in computer vision, has traditionally relied on convolutional neural networks (CNNs), yet these models struggle with capturing complex spatial dependencies, objects with varying scales, need for manually…

Computer Vision and Pattern Recognition · Computer Science 2025-01-17 Deepjyoti Chetia , Debasish Dutta , Sanjib Kr Kalita

ISTR: End-to-End Instance Segmentation with Transformers

End-to-end paradigms significantly improve the accuracy of various deep-learning-based computer vision models. To this end, tasks like object detection have been upgraded by replacing non-end-to-end components, such as removing non-maximum…

Computer Vision and Pattern Recognition · Computer Science 2021-05-07 Jie Hu , Liujuan Cao , Yao Lu , ShengChuan Zhang , Yan Wang , Ke Li , Feiyue Huang , Ling Shao , Rongrong Ji

Bottom-up Instance Segmentation using Deep Higher-Order CRFs

Traditional Scene Understanding problems such as Object Detection and Semantic Segmentation have made breakthroughs in recent years due to the adoption of deep learning. However, the former task is not able to localise objects at a pixel…

Computer Vision and Pattern Recognition · Computer Science 2016-09-12 Anurag Arnab , Philip H. S. Torr

CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image Segmentation

Convolutional neural networks (CNNs) have been the de facto standard for nowadays 3D medical image segmentation. The convolutional operations used in these networks, however, inevitably have limitations in modeling the long-range dependency…

Computer Vision and Pattern Recognition · Computer Science 2021-03-05 Yutong Xie , Jianpeng Zhang , Chunhua Shen , Yong Xia

FocalUNETR: A Focal Transformer for Boundary-aware Segmentation of CT Images

Computed Tomography (CT) based precise prostate segmentation for treatment planning is challenging due to (1) the unclear boundary of the prostate derived from CT's poor soft tissue contrast and (2) the limitation of convolutional neural…

Image and Video Processing · Electrical Eng. & Systems 2023-07-20 Chengyin Li , Yao Qiang , Rafi Ibn Sultan , Hassan Bagher-Ebadian , Prashant Khanduri , Indrin J. Chetty , Dongxiao Zhu

DOCTR: Disentangled Object-Centric Transformer for Point Scene Understanding

Point scene understanding is a challenging task to process real-world scene point cloud, which aims at segmenting each object, estimating its pose, and reconstructing its mesh simultaneously. Recent state-of-the-art method first segments…

Computer Vision and Pattern Recognition · Computer Science 2024-03-26 Xiaoxuan Yu , Hao Wang , Weiming Li , Qiang Wang , Soonyong Cho , Younghun Sung

Mask R-CNN

We present a conceptually simple, flexible, and general framework for object instance segmentation. Our approach efficiently detects objects in an image while simultaneously generating a high-quality segmentation mask for each instance. The…

Computer Vision and Pattern Recognition · Computer Science 2018-01-25 Kaiming He , Georgia Gkioxari , Piotr Dollár , Ross Girshick

Mask Transfiner for High-Quality Instance Segmentation

Two-stage and query-based instance segmentation methods have achieved remarkable results. However, their segmented masks are still very coarse. In this paper, we present Mask Transfiner for high-quality and efficient instance segmentation.…

Computer Vision and Pattern Recognition · Computer Science 2021-11-29 Lei Ke , Martin Danelljan , Xia Li , Yu-Wing Tai , Chi-Keung Tang , Fisher Yu

ConvTransSeg: A Multi-resolution Convolution-Transformer Network for Medical Image Segmentation

Convolutional neural networks (CNNs) achieved the state-of-the-art performance in medical image segmentation due to their ability to extract highly complex feature representations. However, it is argued in recent studies that traditional…

Computer Vision and Pattern Recognition · Computer Science 2025-03-31 Zhendi Gong , Andrew P. French , Guoping Qiu , Xin Chen

DETR-based Layered Clothing Segmentation and Fine-Grained Attribute Recognition

Clothing segmentation and fine-grained attribute recognition are challenging tasks at the crossing of computer vision and fashion, which segment the entire ensemble clothing instances as well as recognize detailed attributes of the clothing…

Computer Vision and Pattern Recognition · Computer Science 2023-04-18 Hao Tian , Yu Cao , P. Y. Mok

Contour Proposal Networks for Biomedical Instance Segmentation

We present a conceptually simple framework for object instance segmentation called Contour Proposal Network (CPN), which detects possibly overlapping objects in an image while simultaneously fitting closed object contours using an…

Computer Vision and Pattern Recognition · Computer Science 2021-04-09 Eric Upschulte , Stefan Harmeling , Katrin Amunts , Timo Dickscheid

End-to-End Video Instance Segmentation with Transformers

Video instance segmentation (VIS) is the task that requires simultaneously classifying, segmenting and tracking object instances of interest in video. Recent methods typically develop sophisticated pipelines to tackle this task. Here, we…

Computer Vision and Pattern Recognition · Computer Science 2021-10-11 Yuqing Wang , Zhaoliang Xu , Xinlong Wang , Chunhua Shen , Baoshan Cheng , Hao Shen , Huaxia Xia