English
Related papers

Related papers: DiffusionDet: Diffusion Model for Object Detection

200 papers

Good 3D object detection performance from LiDAR-Camera sensors demands seamless feature alignment and fusion strategies. We propose the 3DifFusionDet framework in this paper, which structures 3D object detection as a denoising diffusion…

Computer Vision and Pattern Recognition · Computer Science 2023-11-08 Xinhao Xiang , Simon Dräger , Jiawei Zhang

Object detection, a quintessential task in the realm of perceptual computing, can be tackled using a generative methodology. In the present study, we introduce a novel framework designed to articulate object detection as a denoising…

Computer Vision and Pattern Recognition · Computer Science 2025-04-04 Lifan Jiang , Zhihui Wang , Changmiao Wang , Ming Li , Jiaxu Leng

3D object detection is an essential task for achieving autonomous driving. Existing anchor-based detection methods rely on empirical heuristics setting of anchors, which makes the algorithms lack elegance. In recent years, we have witnessed…

Computer Vision and Pattern Recognition · Computer Science 2023-09-06 Xin Zhou , Jinghua Hou , Tingting Yao , Dingkang Liang , Zhe Liu , Zhikang Zou , Xiaoqing Ye , Jianwei Cheng , Xiang Bai

Camouflaged object detection is a challenging task that aims to identify objects that are highly similar to their background. Due to the powerful noise-to-image denoising capability of denoising diffusion models, in this paper, we propose a…

Computer Vision and Pattern Recognition · Computer Science 2023-08-08 Zhennan Chen , Rongrong Gao , Tian-Zhu Xiang , Fan Lin

Multi-object tracking (MOT) is a challenging vision task that aims to detect individual objects within a single frame and associate them across multiple frames. Recent MOT approaches can be categorized into two-stage tracking-by-detection…

Computer Vision and Pattern Recognition · Computer Science 2024-02-22 Run Luo , Zikai Song , Lintao Ma , Jinlin Wei , Wei Yang , Min Yang

Denoising diffusion models show remarkable performances in generative tasks, and their potential applications in perception tasks are gaining interest. In this paper, we introduce a novel framework named DiffRef3D which adopts the diffusion…

Computer Vision and Pattern Recognition · Computer Science 2023-10-26 Se-Ho Kim , Inyong Koo , Inyoung Lee , Byeongjun Park , Changick Kim

We present FlowDet, the first formulation of object detection using modern Conditional Flow Matching techniques. This work follows from DiffusionDet, which originally framed detection as a generative denoising problem in the bounding box…

Computer Vision and Pattern Recognition · Computer Science 2025-12-19 Enis Baty , C. P. Bridges , Simon Hadfield

Vision-based autonomous driving requires reliable and efficient object detection. This work proposes a DiffusionDet-based framework that exploits data fusion from the monocular camera and depth sensor to provide the RGB and depth (RGB-D)…

Computer Vision and Pattern Recognition · Computer Science 2024-06-06 Eliraz Orfaig , Inna Stainvas , Igal Bilik

This paper presents GenDet, a novel framework that redefines object detection as an image generation task. In contrast to traditional approaches, GenDet adopts a pioneering approach by leveraging generative modeling: it conditions on the…

Computer Vision and Pattern Recognition · Computer Science 2026-01-13 Chen Min , Chengyang Li , Fanjie Kong , Qi Zhu , Dawei Zhao , Liang Xiao

Diffusion frameworks have achieved comparable performance with previous state-of-the-art image generation models. Researchers are curious about its variants in discriminative tasks because of its powerful noise-to-image denoising pipeline.…

Computer Vision and Pattern Recognition · Computer Science 2022-12-29 Zhangxuan Gu , Haoxing Chen , Zhuoer Xu , Jun Lan , Changhua Meng , Weiqiang Wang

Modern diffusion-based image generative models have made significant progress and become promising to enrich training data for the object detection task. However, the generation quality and the controllability for complex scenes containing…

Computer Vision and Pattern Recognition · Computer Science 2024-11-07 Jingyuan Zhu , Shiyu Li , Yuxuan Liu , Ping Huang , Jiulong Shan , Huimin Ma , Jian Yuan

Object detection models represented by YOLO series have been widely used and have achieved great results on the high quality datasets, but not all the working conditions are ideal. To settle down the problem of locating targets on low…

Computer Vision and Pattern Recognition · Computer Science 2024-01-04 Yichen Liu , Huajian Zhang , Daqing Gao

Diffusion models have recently gained prominence as powerful deep generative models, demonstrating unmatched performance across various domains. However, their potential in multi-sensor fusion remains largely unexplored. In this work, we…

Computer Vision and Pattern Recognition · Computer Science 2024-09-25 Duy-Tho Le , Hengcan Shi , Jianfei Cai , Hamid Rezatofighi

For object detection, it is possible to view the prediction of bounding boxes as a reverse diffusion process. Using a diffusion model, the random bounding boxes are iteratively refined in a denoising step, conditioned on the image. We…

Computer Vision and Pattern Recognition · Computer Science 2023-12-20 Leander van den Heuvel , Gertjan Burghouts , David W. Zhang , Gwenn Englebienne , Sabina B. van Rooij

Object detectors often suffer a decrease in performance due to the large domain gap between the training data (source domain) and real-world data (target domain). Diffusion-based generative models have shown remarkable abilities in…

Computer Vision and Pattern Recognition · Computer Science 2025-06-05 Boyong He , Yuxiang Ji , Zhuoyue Tan , Liaoni Wu

3D object detection is essential for understanding 3D scenes. Contemporary techniques often require extensive annotated training data, yet obtaining point-wise annotations for point clouds is time-consuming and laborious. Recent…

Computer Vision and Pattern Recognition · Computer Science 2024-08-02 Jiacheng Deng , Jiahao Lu , Tianzhu Zhang

Semi-supervised object detection is crucial for 3D scene understanding, efficiently addressing the limitation of acquiring large-scale 3D bounding box annotations. Existing methods typically employ a teacher-student framework with…

Computer Vision and Pattern Recognition · Computer Science 2023-12-06 Cheng-Ju Ho , Chen-Hsuan Tai , Yen-Yu Lin , Ming-Hsuan Yang , Yi-Hsuan Tsai

Estimating the 6D object pose from a single RGB image often involves noise and indeterminacy due to challenges such as occlusions and cluttered backgrounds. Meanwhile, diffusion models have shown appealing performance in generating…

Computer Vision and Pattern Recognition · Computer Science 2024-03-25 Li Xu , Haoxuan Qu , Yujun Cai , Jun Liu

Collaborative 3D object detection holds significant importance in the field of autonomous driving, as it greatly enhances the perception capabilities of each individual agent by facilitating information exchange among multiple agents.…

Computer Vision and Pattern Recognition · Computer Science 2025-09-05 Zhe Huang , Shuo Wang , Yongcai Wang , Lei Wang

Detecting objects seamlessly blended into their surroundings represents a complex task for both human cognitive capabilities and advanced artificial intelligence algorithms. Currently, the majority of methodologies for detecting camouflaged…

Computer Vision and Pattern Recognition · Computer Science 2024-07-19 Jianwei Zhao , Xin Li , Fan Yang , Qiang Zhai , Ao Luo , Zicheng Jiao , Hong Cheng
‹ Prev 1 2 3 10 Next ›