Related papers: Diffusion-based 3D Object Detection with Random Bo…

3DifFusionDet: Diffusion Model for 3D Object Detection with Robust LiDAR-Camera Fusion

Good 3D object detection performance from LiDAR-Camera sensors demands seamless feature alignment and fusion strategies. We propose the 3DifFusionDet framework in this paper, which structures 3D object detection as a denoising diffusion…

Computer Vision and Pattern Recognition · Computer Science 2023-11-08 Xinhao Xiang , Simon Dräger , Jiawei Zhang

DiffusionDet: Diffusion Model for Object Detection

We propose DiffusionDet, a new framework that formulates object detection as a denoising diffusion process from noisy boxes to object boxes. During the training stage, object boxes diffuse from ground-truth boxes to random distribution, and…

Computer Vision and Pattern Recognition · Computer Science 2023-08-22 Shoufa Chen , Peize Sun , Yibing Song , Ping Luo

Enhanced Automotive Object Detection via RGB-D Fusion in a DiffusionDet Framework

Vision-based autonomous driving requires reliable and efficient object detection. This work proposes a DiffusionDet-based framework that exploits data fusion from the monocular camera and depth sensor to provide the RGB and depth (RGB-D)…

Computer Vision and Pattern Recognition · Computer Science 2024-06-06 Eliraz Orfaig , Inna Stainvas , Igal Bilik

GenDet: Painting Colored Bounding Boxes on Images via Diffusion Model for Object Detection

This paper presents GenDet, a novel framework that redefines object detection as an image generation task. In contrast to traditional approaches, GenDet adopts a pioneering approach by leveraging generative modeling: it conditions on the…

Computer Vision and Pattern Recognition · Computer Science 2026-01-13 Chen Min , Chengyang Li , Fanjie Kong , Qi Zhu , Dawei Zhao , Liang Xiao

DiffuBox: Refining 3D Object Detection with Point Diffusion

Ensuring robust 3D object detection and localization is crucial for many applications in robotics and autonomous driving. Recent models, however, face difficulties in maintaining high performance when applied to domains with differing…

Computer Vision and Pattern Recognition · Computer Science 2024-12-10 Xiangyu Chen , Zhenzhen Liu , Katie Z Luo , Siddhartha Datta , Adhitya Polavaram , Yan Wang , Yurong You , Boyi Li , Marco Pavone , Wei-Lun Chao , Mark Campbell , Bharath Hariharan , Kilian Q. Weinberger

DiffRef3D: A Diffusion-based Proposal Refinement Framework for 3D Object Detection

Denoising diffusion models show remarkable performances in generative tasks, and their potential applications in perception tasks are gaining interest. In this paper, we introduce a novel framework named DiffRef3D which adopts the diffusion…

Computer Vision and Pattern Recognition · Computer Science 2023-10-26 Se-Ho Kim , Inyong Koo , Inyoung Lee , Byeongjun Park , Changick Kim

Diff3DETR:Agent-based Diffusion Model for Semi-supervised 3D Object Detection

3D object detection is essential for understanding 3D scenes. Contemporary techniques often require extensive annotated training data, yet obtaining point-wise annotations for point clouds is time-consuming and laborious. Recent…

Computer Vision and Pattern Recognition · Computer Science 2024-08-02 Jiacheng Deng , Jiahao Lu , Tianzhu Zhang

CatFree3D: Category-agnostic 3D Object Detection with Diffusion

Image-based 3D object detection is widely employed in applications such as autonomous vehicles and robotics, yet current systems struggle with generalisation due to complex problem setup and limited training data. We introduce a novel…

Computer Vision and Pattern Recognition · Computer Science 2024-08-26 Wenjing Bian , Zirui Wang , Andrea Vedaldi

3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features

We present 3DiffTection, a state-of-the-art method for 3D object detection from single images, leveraging features from a 3D-aware diffusion model. Annotating large-scale image data for 3D detection is resource-intensive and time-consuming.…

Computer Vision and Pattern Recognition · Computer Science 2023-11-09 Chenfeng Xu , Huan Ling , Sanja Fidler , Or Litany

Diffusion-Based Generative Models for 3D Occupancy Prediction in Autonomous Driving

Accurately predicting 3D occupancy grids from visual inputs is critical for autonomous driving, but current discriminative methods struggle with noisy data, incomplete observations, and the complex structures inherent in 3D scenes. In this…

Computer Vision and Pattern Recognition · Computer Science 2025-07-04 Yunshen Wang , Yicheng Liu , Tianyuan Yuan , Yingshi Liang , Xiuyu Yang , Honggang Zhang , Hang Zhao

DC3DO: Diffusion Classifier for 3D Objects

Inspired by Geoffrey Hinton emphasis on generative modeling, To recognize shapes, first learn to generate them, we explore the use of 3D diffusion models for object classification. Leveraging the density estimates from these models, our…

Computer Vision and Pattern Recognition · Computer Science 2024-09-27 Nursena Koprucu , Meher Shashwat Nigam , Shicheng Xu , Biruk Abere , Gabriele Dominici , Andrew Rodriguez , Sharvaree Vadgama , Berfin Inal , Alberto Tono

Diffusion-SS3D: Diffusion Model for Semi-supervised 3D Object Detection

Semi-supervised object detection is crucial for 3D scene understanding, efficiently addressing the limitation of acquiring large-scale 3D bounding box annotations. Existing methods typically employ a teacher-student framework with…

Computer Vision and Pattern Recognition · Computer Science 2023-12-06 Cheng-Ju Ho , Chen-Hsuan Tai , Yen-Yu Lin , Ming-Hsuan Yang , Yi-Hsuan Tsai

Real-time 3D object proposal generation and classification under limited processing resources

The task of detecting 3D objects is important to various robotic applications. The existing deep learning-based detection techniques have achieved impressive performance. However, these techniques are limited to run with a graphics…

Computer Vision and Pattern Recognition · Computer Science 2020-08-14 Xuesong Li , Jose Guivant , Subhan Khan

FlowDet: Unifying Object Detection and Generative Transport Flows

We present FlowDet, the first formulation of object detection using modern Conditional Flow Matching techniques. This work follows from DiffusionDet, which originally framed detection as a generative denoising problem in the bounding box…

Computer Vision and Pattern Recognition · Computer Science 2025-12-19 Enis Baty , C. P. Bridges , Simon Hadfield

DiffusionTrack: Diffusion Model For Multi-Object Tracking

Multi-object tracking (MOT) is a challenging vision task that aims to detect individual objects within a single frame and associate them across multiple frames. Recent MOT approaches can be categorized into two-stage tracking-by-detection…

Computer Vision and Pattern Recognition · Computer Science 2024-02-22 Run Luo , Zikai Song , Lintao Ma , Jinlin Wei , Wei Yang , Min Yang

3D Bounding Box Estimation for Autonomous Vehicles by Cascaded Geometric Constraints and Depurated 2D Detections Using 3D Results

3D object detection is one of the most important tasks in 3D vision perceptual system of autonomous vehicles. In this paper, we propose a novel two stage 3D object detection method aimed at get the optimal solution of object location in 3D…

Computer Vision and Pattern Recognition · Computer Science 2019-09-05 Jiaojiao Fang , Lingtao Zhou , Guizhong Liu

3D Object Detection from Point Cloud via Voting Step Diffusion

3D object detection is a fundamental task in scene understanding. Numerous research efforts have been dedicated to better incorporate Hough voting into the 3D object detection pipeline. However, due to the noisy, cluttered, and partial…

Computer Vision and Pattern Recognition · Computer Science 2024-03-22 Haoran Hou , Mingtao Feng , Zijie Wu , Weisheng Dong , Qing Zhu , Yaonan Wang , Ajmal Mian

ODGEN: Domain-specific Object Detection Data Generation with Diffusion Models

Modern diffusion-based image generative models have made significant progress and become promising to enrich training data for the object detection task. However, the generation quality and the controllability for complex scenes containing…

Computer Vision and Pattern Recognition · Computer Science 2024-11-07 Jingyuan Zhu , Shiyu Li , Yuxuan Liu , Ping Huang , Jiulong Shan , Huimin Ma , Jian Yuan

GS3D: An Efficient 3D Object Detection Framework for Autonomous Driving

We present an efficient 3D object detection framework based on a single RGB image in the scenario of autonomous driving. Our efforts are put on extracting the underlying 3D information in a 2D image and determining the accurate 3D bounding…

Computer Vision and Pattern Recognition · Computer Science 2019-03-28 Buyu Li , Wanli Ouyang , Lu Sheng , Xingyu Zeng , Xiaogang Wang

Diffusion Domain Teacher: Diffusion Guided Domain Adaptive Object Detector

Object detectors often suffer a decrease in performance due to the large domain gap between the training data (source domain) and real-world data (target domain). Diffusion-based generative models have shown remarkable abilities in…

Computer Vision and Pattern Recognition · Computer Science 2025-06-05 Boyong He , Yuxiang Ji , Zhuoyue Tan , Liaoni Wu