English
Related papers

Related papers: Cross Resolution Encoding-Decoding For Detection T…

200 papers

Recent DEtection TRansformer-based (DETR) models have obtained remarkable performance. Its success cannot be achieved without the re-introduction of multi-scale feature fusion in the encoder. However, the excessively increased tokens in…

Computer Vision and Pattern Recognition · Computer Science 2023-03-14 Feng Li , Ailing Zeng , Shilong Liu , Hao Zhang , Hongyang Li , Lei Zhang , Lionel M. Ni

DETR has been recently proposed to eliminate the need for many hand-designed components in object detection while demonstrating good performance. However, it suffers from slow convergence and limited feature spatial resolution, due to the…

Computer Vision and Pattern Recognition · Computer Science 2021-03-19 Xizhou Zhu , Weijie Su , Lewei Lu , Bin Li , Xiaogang Wang , Jifeng Dai

End-to-end Object Detection with Transformer (DETR)proposes to perform object detection with Transformer and achieve comparable performance with two-stage object detection like Faster-RCNN. However, DETR needs huge computational resources…

Computer Vision and Pattern Recognition · Computer Science 2021-10-19 Minghang Zheng , Peng Gao , Renrui Zhang , Kunchang Li , Xiaogang Wang , Hongsheng Li , Hao Dong

DETR is the first end-to-end object detector using a transformer encoder-decoder architecture and demonstrates competitive performance but low computational efficiency on high resolution feature maps. The subsequent work, Deformable DETR,…

Computer Vision and Pattern Recognition · Computer Science 2022-03-07 Byungseok Roh , JaeWoong Shin , Wuhyun Shin , Saehoon Kim

Object detection is one of the most significant aspects of computer vision, and it has achieved substantial results in a variety of domains. It is worth noting that there are few studies focusing on slender object detection. CNNs are widely…

Computer Vision and Pattern Recognition · Computer Science 2022-04-25 Wen Feng , Wang Mei , Hu Xiaojie

DEtection TRansformer (DETR) for object detection reaches competitive performance compared with Faster R-CNN via a transformer encoder-decoder architecture. However, trained with scratch transformers, DETR needs large-scale training data…

Computer Vision and Pattern Recognition · Computer Science 2023-07-25 Zhigang Dai , Bolun Cai , Yugeng Lin , Junying Chen

DETR is a recently proposed Transformer-based method which views object detection as a set prediction problem and achieves state-of-the-art performance but demands extra-long training time to converge. In this paper, we investigate the…

Computer Vision and Pattern Recognition · Computer Science 2021-10-13 Zhiqing Sun , Shengcao Cao , Yiming Yang , Kris Kitani

Moving Object Detection (MOD) is a crucial task for the Autonomous Driving pipeline. MOD is usually handled via 2-stream convolutional architectures that incorporates both appearance and motion cues, without considering the inter-relations…

Computer Vision and Pattern Recognition · Computer Science 2021-06-23 Eslam Mohamed , Ahmad El-Sallab

Automated visual inspection of locomotive coil springs presents significant challenges due to the morphological diversity of surface defects, substantial scale variations, and complex industrial backgrounds. This paper proposes MSD-DETR…

Computer Vision and Pattern Recognition · Computer Science 2026-03-17 Matteo Rossi , Pony Matt

Dense object detection is widely used in automatic driving, video surveillance, and other fields. This paper focuses on the challenging task of dense object detection. Currently, detection methods based on greedy algorithms, such as…

Computer Vision and Pattern Recognition · Computer Science 2025-02-12 Yueming Huang , Chenrui Ma , Hao Zhou , Hao Wu , Guowu Yuan

We present a new method that views object detection as a direct set prediction problem. Our approach streamlines the detection pipeline, effectively removing the need for many hand-designed components like a non-maximum suppression…

Computer Vision and Pattern Recognition · Computer Science 2020-05-29 Nicolas Carion , Francisco Massa , Gabriel Synnaeve , Nicolas Usunier , Alexander Kirillov , Sergey Zagoruyko

This paper presents a general scheme for enhancing the convergence and performance of DETR (DEtection TRansformer). We investigate the slow convergence problem in transformers from a new perspective, suggesting that it arises from the…

Computer Vision and Pattern Recognition · Computer Science 2024-07-17 Xiuquan Hou , Meiqin Liu , Senlin Zhang , Ping Wei , Badong Chen , Xuguang Lan

In this paper, we are interested in Detection Transformer (DETR), an end-to-end object detection approach based on a transformer encoder-decoder architecture without hand-crafted postprocessing, such as NMS. Inspired by Conditional DETR, an…

Computer Vision and Pattern Recognition · Computer Science 2022-07-20 Xiaokang Chen , Fangyun Wei , Gang Zeng , Jingdong Wang

Convolutional Neural Networks (CNN) have dominated the field of detection ever since the success of AlexNet in ImageNet classification [12]. With the sweeping reform of Transformers [27] in natural language processing, Carion et al. [2]…

Computer Vision and Pattern Recognition · Computer Science 2022-06-08 Chi Zhang , Lijuan Liu , Xiaoxue Zang , Frederick Liu , Hao Zhang , Xinying Song , Jindong Chen

Transformer-based object detectors (DETR) have shown significant performance across machine vision tasks, ultimately in object detection. This detector is based on a self-attention mechanism along with the transformer encoder-decoder…

Computer Vision and Pattern Recognition · Computer Science 2023-10-16 Zhao Ning Zou , Yuhang Zhang , Robert Wijaya

Object detection with Transformers (DETR) has achieved a competitive performance over traditional detectors, such as Faster R-CNN. However, the potential of DETR remains largely unexplored for the more challenging task of arbitrary-oriented…

Computer Vision and Pattern Recognition · Computer Science 2021-06-08 Teli Ma , Mingyuan Mao , Honghui Zheng , Peng Gao , Xiaodi Wang , Shumin Han , Errui Ding , Baochang Zhang , David Doermann

This paper presents a DETR-based method for cross-domain weakly supervised object detection (CDWSOD), aiming at adapting the detector from source to target domain through weak supervision. We think DETR has strong potential for CDWSOD due…

Computer Vision and Pattern Recognition · Computer Science 2023-04-17 Zongheng Tang , Yifan Sun , Si Liu , Yi Yang

The recently-developed DETR approach applies the transformer encoder and decoder architecture to object detection and achieves promising performance. In this paper, we handle the critical issue, slow training convergence, and present a…

Computer Vision and Pattern Recognition · Computer Science 2023-10-02 Depu Meng , Xiaokang Chen , Zejia Fan , Gang Zeng , Houqiang Li , Yuhui Yuan , Lei Sun , Jingdong Wang

The recently proposed Detection Transformer (DETR) model successfully applies Transformer to objects detection and achieves comparable performance with two-stage object detection frameworks, such as Faster-RCNN. However, DETR suffers from…

Computer Vision and Pattern Recognition · Computer Science 2021-08-21 Peng Gao , Minghang Zheng , Xiaogang Wang , Jifeng Dai , Hongsheng Li

In this paper, we propose a novel query design for the transformer-based object detection. In previous transformer-based detectors, the object queries are a set of learned embeddings. However, each learned embedding does not have an…

Computer Vision and Pattern Recognition · Computer Science 2022-01-05 Yingming Wang , Xiangyu Zhang , Tong Yang , Jian Sun
‹ Prev 1 2 3 10 Next ›