Related papers: DiffusionDet: Diffusion Model for Object Detection

3DifFusionDet: Diffusion Model for 3D Object Detection with Robust LiDAR-Camera Fusion

Good 3D object detection performance from LiDAR-Camera sensors demands seamless feature alignment and fusion strategies. We propose the 3DifFusionDet framework in this paper, which structures 3D object detection as a denoising diffusion…

Computer Vision and Pattern Recognition · Computer Science 2023-11-08 Xinhao Xiang , Simon Dräger , Jiawei Zhang

ConsistencyDet: A Few-step Denoising Framework for Object Detection Using the Consistency Model

Object detection, a quintessential task in the realm of perceptual computing, can be tackled using a generative methodology. In the present study, we introduce a novel framework designed to articulate object detection as a denoising…

Computer Vision and Pattern Recognition · Computer Science 2025-04-04 Lifan Jiang , Zhihui Wang , Changmiao Wang , Ming Li , Jiaxu Leng

Diffusion-based 3D Object Detection with Random Boxes

3D object detection is an essential task for achieving autonomous driving. Existing anchor-based detection methods rely on empirical heuristics setting of anchors, which makes the algorithms lack elegance. In recent years, we have witnessed…

Computer Vision and Pattern Recognition · Computer Science 2023-09-06 Xin Zhou , Jinghua Hou , Tingting Yao , Dingkang Liang , Zhe Liu , Zhikang Zou , Xiaoqing Ye , Jianwei Cheng , Xiang Bai

Diffusion Model for Camouflaged Object Detection

Camouflaged object detection is a challenging task that aims to identify objects that are highly similar to their background. Due to the powerful noise-to-image denoising capability of denoising diffusion models, in this paper, we propose a…

Computer Vision and Pattern Recognition · Computer Science 2023-08-08 Zhennan Chen , Rongrong Gao , Tian-Zhu Xiang , Fan Lin

DiffusionTrack: Diffusion Model For Multi-Object Tracking

Multi-object tracking (MOT) is a challenging vision task that aims to detect individual objects within a single frame and associate them across multiple frames. Recent MOT approaches can be categorized into two-stage tracking-by-detection…

Computer Vision and Pattern Recognition · Computer Science 2024-02-22 Run Luo , Zikai Song , Lintao Ma , Jinlin Wei , Wei Yang , Min Yang

DiffRef3D: A Diffusion-based Proposal Refinement Framework for 3D Object Detection

Denoising diffusion models show remarkable performances in generative tasks, and their potential applications in perception tasks are gaining interest. In this paper, we introduce a novel framework named DiffRef3D which adopts the diffusion…

Computer Vision and Pattern Recognition · Computer Science 2023-10-26 Se-Ho Kim , Inyong Koo , Inyoung Lee , Byeongjun Park , Changick Kim

FlowDet: Unifying Object Detection and Generative Transport Flows

We present FlowDet, the first formulation of object detection using modern Conditional Flow Matching techniques. This work follows from DiffusionDet, which originally framed detection as a generative denoising problem in the bounding box…

Computer Vision and Pattern Recognition · Computer Science 2025-12-19 Enis Baty , C. P. Bridges , Simon Hadfield

Enhanced Automotive Object Detection via RGB-D Fusion in a DiffusionDet Framework

Vision-based autonomous driving requires reliable and efficient object detection. This work proposes a DiffusionDet-based framework that exploits data fusion from the monocular camera and depth sensor to provide the RGB and depth (RGB-D)…

Computer Vision and Pattern Recognition · Computer Science 2024-06-06 Eliraz Orfaig , Inna Stainvas , Igal Bilik

GenDet: Painting Colored Bounding Boxes on Images via Diffusion Model for Object Detection

This paper presents GenDet, a novel framework that redefines object detection as an image generation task. In contrast to traditional approaches, GenDet adopts a pioneering approach by leveraging generative modeling: it conditions on the…

Computer Vision and Pattern Recognition · Computer Science 2026-01-13 Chen Min , Chengyang Li , Fanjie Kong , Qi Zhu , Dawei Zhao , Liang Xiao

DiffusionInst: Diffusion Model for Instance Segmentation

Diffusion frameworks have achieved comparable performance with previous state-of-the-art image generation models. Researchers are curious about its variants in discriminative tasks because of its powerful noise-to-image denoising pipeline.…

Computer Vision and Pattern Recognition · Computer Science 2022-12-29 Zhangxuan Gu , Haoxing Chen , Zhuoer Xu , Jun Lan , Changhua Meng , Weiqiang Wang

ODGEN: Domain-specific Object Detection Data Generation with Diffusion Models

Modern diffusion-based image generative models have made significant progress and become promising to enrich training data for the object detection task. However, the generation quality and the controllability for complex scenes containing…

Computer Vision and Pattern Recognition · Computer Science 2024-11-07 Jingyuan Zhu , Shiyu Li , Yuxuan Liu , Ping Huang , Jiulong Shan , Huimin Ma , Jian Yuan

DiffYOLO: Object Detection for Anti-Noise via YOLO and Diffusion Models

Object detection models represented by YOLO series have been widely used and have achieved great results on the high quality datasets, but not all the working conditions are ideal. To settle down the problem of locating targets on low…

Computer Vision and Pattern Recognition · Computer Science 2024-01-04 Yichen Liu , Huajian Zhang , Daqing Gao

DifFUSER: Diffusion Model for Robust Multi-Sensor Fusion in 3D Object Detection and BEV Segmentation

Diffusion models have recently gained prominence as powerful deep generative models, demonstrating unmatched performance across various domains. However, their potential in multi-sensor fusion remains largely unexplored. In this work, we…

Computer Vision and Pattern Recognition · Computer Science 2024-09-25 Duy-Tho Le , Hengcan Shi , Jianfei Cai , Hamid Rezatofighi

Diffusing More Objects for Semi-Supervised Domain Adaptation with Less Labeling

For object detection, it is possible to view the prediction of bounding boxes as a reverse diffusion process. Using a diffusion model, the random bounding boxes are iteratively refined in a denoising step, conditioned on the image. We…

Computer Vision and Pattern Recognition · Computer Science 2023-12-20 Leander van den Heuvel , Gertjan Burghouts , David W. Zhang , Gwenn Englebienne , Sabina B. van Rooij

Diffusion Domain Teacher: Diffusion Guided Domain Adaptive Object Detector

Object detectors often suffer a decrease in performance due to the large domain gap between the training data (source domain) and real-world data (target domain). Diffusion-based generative models have shown remarkable abilities in…

Computer Vision and Pattern Recognition · Computer Science 2025-06-05 Boyong He , Yuxiang Ji , Zhuoyue Tan , Liaoni Wu

Diff3DETR:Agent-based Diffusion Model for Semi-supervised 3D Object Detection

3D object detection is essential for understanding 3D scenes. Contemporary techniques often require extensive annotated training data, yet obtaining point-wise annotations for point clouds is time-consuming and laborious. Recent…

Computer Vision and Pattern Recognition · Computer Science 2024-08-02 Jiacheng Deng , Jiahao Lu , Tianzhu Zhang

Diffusion-SS3D: Diffusion Model for Semi-supervised 3D Object Detection

Semi-supervised object detection is crucial for 3D scene understanding, efficiently addressing the limitation of acquiring large-scale 3D bounding box annotations. Existing methods typically employ a teacher-student framework with…

Computer Vision and Pattern Recognition · Computer Science 2023-12-06 Cheng-Ju Ho , Chen-Hsuan Tai , Yen-Yu Lin , Ming-Hsuan Yang , Yi-Hsuan Tsai

6D-Diff: A Keypoint Diffusion Framework for 6D Object Pose Estimation

Estimating the 6D object pose from a single RGB image often involves noise and indeterminacy due to challenges such as occlusions and cluttered backgrounds. Meanwhile, diffusion models have shown appealing performance in generating…

Computer Vision and Pattern Recognition · Computer Science 2024-03-25 Li Xu , Haoxuan Qu , Yujun Cai , Jun Liu

CoDiff: Conditional Diffusion Model for Collaborative 3D Object Detection

Collaborative 3D object detection holds significant importance in the field of autonomous driving, as it greatly enhances the perception capabilities of each individual agent by facilitating information exchange among multiple agents.…

Computer Vision and Pattern Recognition · Computer Science 2025-09-05 Zhe Huang , Shuo Wang , Yongcai Wang , Lei Wang

FocusDiffuser: Perceiving Local Disparities for Camouflaged Object Detection

Detecting objects seamlessly blended into their surroundings represents a complex task for both human cognitive capabilities and advanced artificial intelligence algorithms. Currently, the majority of methodologies for detecting camouflaged…

Computer Vision and Pattern Recognition · Computer Science 2024-07-19 Jianwei Zhao , Xin Li , Fan Yang , Qiang Zhai , Ao Luo , Zicheng Jiao , Hong Cheng