Related papers: Cross Resolution Encoding-Decoding For Detection T…

Lite DETR : An Interleaved Multi-Scale Encoder for Efficient DETR

Recent DEtection TRansformer-based (DETR) models have obtained remarkable performance. Its success cannot be achieved without the re-introduction of multi-scale feature fusion in the encoder. However, the excessively increased tokens in…

Computer Vision and Pattern Recognition · Computer Science 2023-03-14 Feng Li , Ailing Zeng , Shilong Liu , Hao Zhang , Hongyang Li , Lei Zhang , Lionel M. Ni

Deformable DETR: Deformable Transformers for End-to-End Object Detection

DETR has been recently proposed to eliminate the need for many hand-designed components in object detection while demonstrating good performance. However, it suffers from slow convergence and limited feature spatial resolution, due to the…

Computer Vision and Pattern Recognition · Computer Science 2021-03-19 Xizhou Zhu , Weijie Su , Lewei Lu , Bin Li , Xiaogang Wang , Jifeng Dai

End-to-End Object Detection with Adaptive Clustering Transformer

End-to-end Object Detection with Transformer (DETR)proposes to perform object detection with Transformer and achieve comparable performance with two-stage object detection like Faster-RCNN. However, DETR needs huge computational resources…

Computer Vision and Pattern Recognition · Computer Science 2021-10-19 Minghang Zheng , Peng Gao , Renrui Zhang , Kunchang Li , Xiaogang Wang , Hongsheng Li , Hao Dong

Sparse DETR: Efficient End-to-End Object Detection with Learnable Sparsity

DETR is the first end-to-end object detector using a transformer encoder-decoder architecture and demonstrates competitive performance but low computational efficiency on high resolution feature maps. The subsequent work, Deformable DETR,…

Computer Vision and Pattern Recognition · Computer Science 2022-03-07 Byungseok Roh , JaeWoong Shin , Wuhyun Shin , Saehoon Kim

DFAM-DETR: Deformable feature based attention mechanism DETR on slender object detection

Object detection is one of the most significant aspects of computer vision, and it has achieved substantial results in a variety of domains. It is worth noting that there are few studies focusing on slender object detection. CNNs are widely…

Computer Vision and Pattern Recognition · Computer Science 2022-04-25 Wen Feng , Wang Mei , Hu Xiaojie

UP-DETR: Unsupervised Pre-training for Object Detection with Transformers

DEtection TRansformer (DETR) for object detection reaches competitive performance compared with Faster R-CNN via a transformer encoder-decoder architecture. However, trained with scratch transformers, DETR needs large-scale training data…

Computer Vision and Pattern Recognition · Computer Science 2023-07-25 Zhigang Dai , Bolun Cai , Yugeng Lin , Junying Chen

Rethinking Transformer-based Set Prediction for Object Detection

DETR is a recently proposed Transformer-based method which views object detection as a set prediction problem and achieves state-of-the-art performance but demands extra-long training time to converge. In this paper, we investigate the…

Computer Vision and Pattern Recognition · Computer Science 2021-10-13 Zhiqing Sun , Shengcao Cao , Yiming Yang , Kris Kitani

MODETR: Moving Object Detection with Transformers

Moving Object Detection (MOD) is a crucial task for the Autonomous Driving pipeline. MOD is usually handled via 2-stream convolutional architectures that incorporates both appearance and motion cues, without considering the inter-relations…

Computer Vision and Pattern Recognition · Computer Science 2021-06-23 Eslam Mohamed , Ahmad El-Sallab

A Deformable Attention-Based Detection Transformer with Cross-Scale Feature Fusion for Industrial Coil Spring Inspection

Automated visual inspection of locomotive coil springs presents significant challenges due to the morphological diversity of surface defects, substantial scale variations, and complex industrial backgrounds. This paper proposes MSD-DETR…

Computer Vision and Pattern Recognition · Computer Science 2026-03-17 Matteo Rossi , Pony Matt

Dense Object Detection Based on De-homogenized Queries

Dense object detection is widely used in automatic driving, video surveillance, and other fields. This paper focuses on the challenging task of dense object detection. Currently, detection methods based on greedy algorithms, such as…

Computer Vision and Pattern Recognition · Computer Science 2025-02-12 Yueming Huang , Chenrui Ma , Hao Zhou , Hao Wu , Guowu Yuan

End-to-End Object Detection with Transformers

We present a new method that views object detection as a direct set prediction problem. Our approach streamlines the detection pipeline, effectively removing the need for many hand-designed components like a non-maximum suppression…

Computer Vision and Pattern Recognition · Computer Science 2020-05-29 Nicolas Carion , Francisco Massa , Gabriel Synnaeve , Nicolas Usunier , Alexander Kirillov , Sergey Zagoruyko

Relation DETR: Exploring Explicit Position Relation Prior for Object Detection

This paper presents a general scheme for enhancing the convergence and performance of DETR (DEtection TRansformer). We investigate the slow convergence problem in transformers from a new perspective, suggesting that it arises from the…

Computer Vision and Pattern Recognition · Computer Science 2024-07-17 Xiuquan Hou , Meiqin Liu , Senlin Zhang , Ping Wei , Badong Chen , Xuguang Lan

Conditional DETR V2: Efficient Detection Transformer with Box Queries

In this paper, we are interested in Detection Transformer (DETR), an end-to-end object detection approach based on a transformer encoder-decoder architecture without hand-crafted postprocessing, such as NMS. Inspired by Conditional DETR, an…

Computer Vision and Pattern Recognition · Computer Science 2022-07-20 Xiaokang Chen , Fangyun Wei , Gang Zeng , Jingdong Wang

DETR++: Taming Your Multi-Scale Detection Transformer

Convolutional Neural Networks (CNN) have dominated the field of detection ever since the success of AlexNet in ImageNet classification [12]. With the sweeping reform of Transformers [27] in natural language processing, Carion et al. [2]…

Computer Vision and Pattern Recognition · Computer Science 2022-06-08 Chi Zhang , Lijuan Liu , Xiaoxue Zang , Frederick Liu , Hao Zhang , Xinying Song , Jindong Chen

Investigating the Robustness and Properties of Detection Transformers (DETR) Toward Difficult Images

Transformer-based object detectors (DETR) have shown significant performance across machine vision tasks, ultimately in object detection. This detector is based on a self-attention mechanism along with the transformer encoder-decoder…

Computer Vision and Pattern Recognition · Computer Science 2023-10-16 Zhao Ning Zou , Yuhang Zhang , Robert Wijaya

Oriented Object Detection with Transformer

Object detection with Transformers (DETR) has achieved a competitive performance over traditional detectors, such as Faster R-CNN. However, the potential of DETR remains largely unexplored for the more challenging task of arbitrary-oriented…

Computer Vision and Pattern Recognition · Computer Science 2021-06-08 Teli Ma , Mingyuan Mao , Honghui Zheng , Peng Gao , Xiaodi Wang , Shumin Han , Errui Ding , Baochang Zhang , David Doermann

DETR with Additional Global Aggregation for Cross-domain Weakly Supervised Object Detection

This paper presents a DETR-based method for cross-domain weakly supervised object detection (CDWSOD), aiming at adapting the detector from source to target domain through weak supervision. We think DETR has strong potential for CDWSOD due…

Computer Vision and Pattern Recognition · Computer Science 2023-04-17 Zongheng Tang , Yifan Sun , Si Liu , Yi Yang

Conditional DETR for Fast Training Convergence

The recently-developed DETR approach applies the transformer encoder and decoder architecture to object detection and achieves promising performance. In this paper, we handle the critical issue, slow training convergence, and present a…

Computer Vision and Pattern Recognition · Computer Science 2023-10-02 Depu Meng , Xiaokang Chen , Zejia Fan , Gang Zeng , Houqiang Li , Yuhui Yuan , Lei Sun , Jingdong Wang

Fast Convergence of DETR with Spatially Modulated Co-Attention

The recently proposed Detection Transformer (DETR) model successfully applies Transformer to objects detection and achieves comparable performance with two-stage object detection frameworks, such as Faster-RCNN. However, DETR suffers from…

Computer Vision and Pattern Recognition · Computer Science 2021-08-21 Peng Gao , Minghang Zheng , Xiaogang Wang , Jifeng Dai , Hongsheng Li

Anchor DETR: Query Design for Transformer-Based Object Detection

In this paper, we propose a novel query design for the transformer-based object detection. In previous transformer-based detectors, the object queries are a set of learned embeddings. However, each learned embedding does not have an…

Computer Vision and Pattern Recognition · Computer Science 2022-01-05 Yingming Wang , Xiangyu Zhang , Tong Yang , Jian Sun