Related papers: Towards Instance Segmentation with Polygon Detecti…

PolyTransform: Deep Polygon Transformer for Instance Segmentation

In this paper, we propose PolyTransform, a novel instance segmentation algorithm that produces precise, geometry-preserving masks by combining the strengths of prevailing segmentation approaches and modern polygon-based methods. In…

Computer Vision and Pattern Recognition · Computer Science 2021-01-19 Justin Liang , Namdar Homayounfar , Wei-Chiu Ma , Yuwen Xiong , Rui Hu , Raquel Urtasun

Position-Guided Point Cloud Panoptic Segmentation Transformer

DEtection TRansformer (DETR) started a trend that uses a group of learnable queries for unified visual perception. This work begins by applying this appealing paradigm to LiDAR-based point cloud segmentation and obtains a simple yet…

Computer Vision and Pattern Recognition · Computer Science 2023-03-24 Zeqi Xiao , Wenwei Zhang , Tai Wang , Chen Change Loy , Dahua Lin , Jiangmiao Pang

PolarMask++: Enhanced Polar Representation for Single-Shot Instance Segmentation and Beyond

Reducing the complexity of the pipeline of instance segmentation is crucial for real-world applications. This work addresses this issue by introducing an anchor-box free and single-shot instance segmentation framework, termed PolarMask,…

Computer Vision and Pattern Recognition · Computer Science 2021-05-06 Enze Xie , Wenhai Wang , Mingyu Ding , Ruimao Zhang , Ping Luo

Attention-Based Transformers for Instance Segmentation of Cells in Microstructures

Detecting and segmenting object instances is a common task in biomedical applications. Examples range from detecting lesions on functional magnetic resonance images, to the detection of tumours in histopathological images and extracting…

Computer Vision and Pattern Recognition · Computer Science 2021-04-12 Tim Prangemeier , Christoph Reich , Heinz Koeppl

Mask Frozen-DETR: High Quality Instance Segmentation with One GPU

In this paper, we aim to study how to build a strong instance segmenter with minimal training time and GPUs, as opposed to the majority of current approaches that pursue more accurate instance segmenter by building more advanced frameworks…

Computer Vision and Pattern Recognition · Computer Science 2023-08-08 Zhanhao Liang , Yuhui Yuan

LP-DETR: Layer-wise Progressive Relations for Object Detection

This paper presents LP-DETR (Layer-wise Progressive DETR), a novel approach that enhances DETR-based object detection through multi-scale relation modeling. Our method introduces learnable spatial relationships between object queries…

Computer Vision and Pattern Recognition · Computer Science 2025-05-14 Zhengjian Kang , Ye Zhang , Xiaoyu Deng , Xintao Li , Yongzhe Zhang

Real-time instance segmentation with polygons using an Intersection-over-Union loss

Predicting a binary mask for an object is more accurate but also more computationally expensive than a bounding box. Polygonal masks as developed in CenterPoly can be a good compromise. In this paper, we improve over CenterPoly by enhancing…

Computer Vision and Pattern Recognition · Computer Science 2023-05-10 Katia Jodogne-Del Litto , Guillaume-Alexandre Bilodeau

PolarMask: Single Shot Instance Segmentation with Polar Representation

In this paper, we introduce an anchor-box free and single shot instance segmentation method, which is conceptually simple, fully convolutional and can be used as a mask prediction module for instance segmentation, by easily embedding it…

Computer Vision and Pattern Recognition · Computer Science 2020-02-27 Enze Xie , Peize Sun , Xiaoge Song , Wenhai Wang , Ding Liang , Chunhua Shen , Ping Luo

Conditional DETR V2: Efficient Detection Transformer with Box Queries

In this paper, we are interested in Detection Transformer (DETR), an end-to-end object detection approach based on a transformer encoder-decoder architecture without hand-crafted postprocessing, such as NMS. Inspired by Conditional DETR, an…

Computer Vision and Pattern Recognition · Computer Science 2022-07-20 Xiaokang Chen , Fangyun Wei , Gang Zeng , Jingdong Wang

Box2Poly: Memory-Efficient Polygon Prediction of Arbitrarily Shaped and Rotated Text

Recently, Transformer-based text detection techniques have sought to predict polygons by encoding the coordinates of individual boundary vertices using distinct query features. However, this approach incurs a significant memory overhead and…

Computer Vision and Pattern Recognition · Computer Science 2025-08-13 Xuyang Chen , Dong Wang , Konrad Schindler , Mingwei Sun , Yongliang Wang , Nicolo Savioli , Liqiu Meng

PolyFormer: Referring Image Segmentation as Sequential Polygon Generation

In this work, instead of directly predicting the pixel-level segmentation masks, the problem of referring image segmentation is formulated as sequential polygon generation, and the predicted polygons can be later converted into segmentation…

Computer Vision and Pattern Recognition · Computer Science 2023-03-29 Jiang Liu , Hui Ding , Zhaowei Cai , Yuting Zhang , Ravi Kumar Satzoda , Vijay Mahadevan , R. Manmatha

DPDETR: Decoupled Position Detection Transformer for Infrared-Visible Object Detection

Infrared-visible object detection aims to achieve robust object detection by leveraging the complementary information of infrared and visible image pairs. However, the commonly existing modality misalignment problem presents two challenges:…

Computer Vision and Pattern Recognition · Computer Science 2025-10-02 Junjie Guo , Chenqiang Gao , Fangcen Liu , Deyu Meng

Contrastive Learning for Multi-Object Tracking with Transformers

The DEtection TRansformer (DETR) opened new possibilities for object detection by modeling it as a translation task: converting image features into object-level representations. Previous works typically add expensive modules to DETR to…

Computer Vision and Pattern Recognition · Computer Science 2025-05-16 Pierre-François De Plaen , Nicola Marinello , Marc Proesmans , Tinne Tuytelaars , Luc Van Gool

MDS-DETR: DETR with Masked Duplicate Suppressor

The DEtection TRansformer (DETR) is a powerful end-to-end object detector, yet its one-to-one matching strategy suffers from slow convergence and low recall. A common approach to address this issue is to use one-to-many label assignment to…

Computer Vision and Pattern Recognition · Computer Science 2026-05-25 Chanho Lee , Seunghee Koh , Yunho Jeon , Junmo Kim

DETR-based Layered Clothing Segmentation and Fine-Grained Attribute Recognition

Clothing segmentation and fine-grained attribute recognition are challenging tasks at the crossing of computer vision and fashion, which segment the entire ensemble clothing instances as well as recognize detailed attributes of the clothing…

Computer Vision and Pattern Recognition · Computer Science 2023-04-18 Hao Tian , Yu Cao , P. Y. Mok

SOTR: Segmenting Objects with Transformers

Most recent transformer-based models show impressive performance on vision tasks, even better than Convolution Neural Networks (CNN). In this work, we present a novel, flexible, and effective transformer-based model for high-quality…

Computer Vision and Pattern Recognition · Computer Science 2021-08-18 Ruohao Guo , Dantong Niu , Liao Qu , Zhenbo Li

Less is More: Focus Attention for Efficient DETR

DETR-like models have significantly boosted the performance of detectors and even outperformed classical convolutional models. However, all tokens are treated equally without discrimination brings a redundant computational burden in the…

Computer Vision and Pattern Recognition · Computer Science 2023-07-25 Dehua Zheng , Wenhui Dong , Hailin Hu , Xinghao Chen , Yunhe Wang

ISTR: End-to-End Instance Segmentation with Transformers

End-to-end paradigms significantly improve the accuracy of various deep-learning-based computer vision models. To this end, tasks like object detection have been upgraded by replacing non-end-to-end components, such as removing non-maximum…

Computer Vision and Pattern Recognition · Computer Science 2021-05-07 Jie Hu , Liujuan Cao , Yao Lu , ShengChuan Zhang , Yan Wang , Ke Li , Feiyue Huang , Ling Shao , Rongrong Ji

PaQ-DETR: Learning Pattern and Quality-Aware Dynamic Queries for Object Detection

Detection Transformer (DETR) has redefined object detection by casting it as a set prediction task within an end-to-end framework. Despite its elegance, DETR and its variants still rely on fixed learnable queries and suffer from severe…

Computer Vision and Pattern Recognition · Computer Science 2026-03-24 Zhengjian Kang , Jun Zhuang , Kangtong Mo , Qi Chen , Rui Liu , Ye Zhang

detrex: Benchmarking Detection Transformers

The DEtection TRansformer (DETR) algorithm has received considerable attention in the research community and is gradually emerging as a mainstream approach for object detection and other perception tasks. However, the current field lacks a…

Computer Vision and Pattern Recognition · Computer Science 2023-06-14 Tianhe Ren , Shilong Liu , Feng Li , Hao Zhang , Ailing Zeng , Jie Yang , Xingyu Liao , Ding Jia , Hongyang Li , He Cao , Jianan Wang , Zhaoyang Zeng , Xianbiao Qi , Yuhui Yuan , Jianwei Yang , Lei Zhang