Related papers: DiffuBox: Refining 3D Object Detection with Point …

3DifFusionDet: Diffusion Model for 3D Object Detection with Robust LiDAR-Camera Fusion

Good 3D object detection performance from LiDAR-Camera sensors demands seamless feature alignment and fusion strategies. We propose the 3DifFusionDet framework in this paper, which structures 3D object detection as a denoising diffusion…

Computer Vision and Pattern Recognition · Computer Science 2023-11-08 Xinhao Xiang , Simon Dräger , Jiawei Zhang

Diffusion-based 3D Object Detection with Random Boxes

3D object detection is an essential task for achieving autonomous driving. Existing anchor-based detection methods rely on empirical heuristics setting of anchors, which makes the algorithms lack elegance. In recent years, we have witnessed…

Computer Vision and Pattern Recognition · Computer Science 2023-09-06 Xin Zhou , Jinghua Hou , Tingting Yao , Dingkang Liang , Zhe Liu , Zhikang Zou , Xiaoqing Ye , Jianwei Cheng , Xiang Bai

DiffRef3D: A Diffusion-based Proposal Refinement Framework for 3D Object Detection

Denoising diffusion models show remarkable performances in generative tasks, and their potential applications in perception tasks are gaining interest. In this paper, we introduce a novel framework named DiffRef3D which adopts the diffusion…

Computer Vision and Pattern Recognition · Computer Science 2023-10-26 Se-Ho Kim , Inyong Koo , Inyoung Lee , Byeongjun Park , Changick Kim

3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features

We present 3DiffTection, a state-of-the-art method for 3D object detection from single images, leveraging features from a 3D-aware diffusion model. Annotating large-scale image data for 3D detection is resource-intensive and time-consuming.…

Computer Vision and Pattern Recognition · Computer Science 2023-11-09 Chenfeng Xu , Huan Ling , Sanja Fidler , Or Litany

DiffusionDet: Diffusion Model for Object Detection

We propose DiffusionDet, a new framework that formulates object detection as a denoising diffusion process from noisy boxes to object boxes. During the training stage, object boxes diffuse from ground-truth boxes to random distribution, and…

Computer Vision and Pattern Recognition · Computer Science 2023-08-22 Shoufa Chen , Peize Sun , Yibing Song , Ping Luo

6D-Diff: A Keypoint Diffusion Framework for 6D Object Pose Estimation

Estimating the 6D object pose from a single RGB image often involves noise and indeterminacy due to challenges such as occlusions and cluttered backgrounds. Meanwhile, diffusion models have shown appealing performance in generating…

Computer Vision and Pattern Recognition · Computer Science 2024-03-25 Li Xu , Haoxuan Qu , Yujun Cai , Jun Liu

Diffusion-SS3D: Diffusion Model for Semi-supervised 3D Object Detection

Semi-supervised object detection is crucial for 3D scene understanding, efficiently addressing the limitation of acquiring large-scale 3D bounding box annotations. Existing methods typically employ a teacher-student framework with…

Computer Vision and Pattern Recognition · Computer Science 2023-12-06 Cheng-Ju Ho , Chen-Hsuan Tai , Yen-Yu Lin , Ming-Hsuan Yang , Yi-Hsuan Tsai

Diffusing More Objects for Semi-Supervised Domain Adaptation with Less Labeling

For object detection, it is possible to view the prediction of bounding boxes as a reverse diffusion process. Using a diffusion model, the random bounding boxes are iteratively refined in a denoising step, conditioned on the image. We…

Computer Vision and Pattern Recognition · Computer Science 2023-12-20 Leander van den Heuvel , Gertjan Burghouts , David W. Zhang , Gwenn Englebienne , Sabina B. van Rooij

DifFUSER: Diffusion Model for Robust Multi-Sensor Fusion in 3D Object Detection and BEV Segmentation

Diffusion models have recently gained prominence as powerful deep generative models, demonstrating unmatched performance across various domains. However, their potential in multi-sensor fusion remains largely unexplored. In this work, we…

Computer Vision and Pattern Recognition · Computer Science 2024-09-25 Duy-Tho Le , Hengcan Shi , Jianfei Cai , Hamid Rezatofighi

Diff9D: Diffusion-Based Domain-Generalized Category-Level 9-DoF Object Pose Estimation

Nine-degrees-of-freedom (9-DoF) object pose and size estimation is crucial for enabling augmented reality and robotic manipulation. Category-level methods have received extensive research attention due to their potential for generalization…

Computer Vision and Pattern Recognition · Computer Science 2025-03-18 Jian Liu , Wei Sun , Hui Yang , Pengchao Deng , Chongpei Liu , Nicu Sebe , Hossein Rahmani , Ajmal Mian

Generalized Diffusion Detector: Mining Robust Features from Diffusion Models for Domain-Generalized Detection

Domain generalization (DG) for object detection aims to enhance detectors' performance in unseen scenarios. This task remains challenging due to complex variations in real-world applications. Recently, diffusion models have demonstrated…

Computer Vision and Pattern Recognition · Computer Science 2025-06-05 Boyong He , Yuxiang Ji , Qianwen Ye , Zhuoyue Tan , Liaoni Wu

Diff3DETR:Agent-based Diffusion Model for Semi-supervised 3D Object Detection

3D object detection is essential for understanding 3D scenes. Contemporary techniques often require extensive annotated training data, yet obtaining point-wise annotations for point clouds is time-consuming and laborious. Recent…

Computer Vision and Pattern Recognition · Computer Science 2024-08-02 Jiacheng Deng , Jiahao Lu , Tianzhu Zhang

Domain Adaptation for Different Sensor Configurations in 3D Object Detection

Recent advances in autonomous driving have underscored the importance of accurate 3D object detection, with LiDAR playing a central role due to its robustness under diverse visibility conditions. However, different vehicle platforms often…

Computer Vision and Pattern Recognition · Computer Science 2025-09-08 Satoshi Tanaka , Kok Seang Tan , Isamu Yamashita

Diffusion Domain Teacher: Diffusion Guided Domain Adaptive Object Detector

Object detectors often suffer a decrease in performance due to the large domain gap between the training data (source domain) and real-world data (target domain). Diffusion-based generative models have shown remarkable abilities in…

Computer Vision and Pattern Recognition · Computer Science 2025-06-05 Boyong He , Yuxiang Ji , Zhuoyue Tan , Liaoni Wu

REXO: Indoor Multi-View Radar Object Detection via 3D Bounding Box Diffusion

Multi-view indoor radar perception has drawn attention due to its cost-effectiveness and low privacy risks. Existing methods often rely on {implicit} cross-view radar feature association, such as proposal pairing in RFMask or…

Computer Vision and Pattern Recognition · Computer Science 2026-01-13 Ryoma Yataka , Pu Perry Wang , Petros Boufounos , Ryuhei Takahashi

CatFree3D: Category-agnostic 3D Object Detection with Diffusion

Image-based 3D object detection is widely employed in applications such as autonomous vehicles and robotics, yet current systems struggle with generalisation due to complex problem setup and limited training data. We introduce a novel…

Computer Vision and Pattern Recognition · Computer Science 2024-08-26 Wenjing Bian , Zirui Wang , Andrea Vedaldi

DiffuSAM: Diffusion Guided Zero-Shot Object Grounding for Remote Sensing Imagery

Diffusion models have emerged as powerful tools for a wide range of vision tasks, including text-guided image generation and editing. In this work, we explore their potential for object grounding in remote sensing imagery. We propose a…

Computer Vision and Pattern Recognition · Computer Science 2026-04-21 Geet Sethi , Panav Shah , Ashutosh Gandhe , Soumitra Darshan Nayak

Boosting Domain Generalized and Adaptive Detection with Diffusion Models: Fitness, Generalization, and Transferability

Detectors often suffer from performance drop due to domain gap between training and testing data. Recent methods explore diffusion models applied to domain generalization (DG) and adaptation (DA) tasks, but still struggle with large…

Computer Vision and Pattern Recognition · Computer Science 2025-07-01 Boyong He , Yuxiang Ji , Zhuoyue Tan , Liaoni Wu

Boundary Distribution Estimation for Precise Object Detection

In the field of state-of-the-art object detection, the task of object localization is typically accomplished through a dedicated subnet that emphasizes bounding box regression. This subnet traditionally predicts the object's position by…

Computer Vision and Pattern Recognition · Computer Science 2023-07-20 Peng Zhi , Haoran Zhou , Hang Huang , Rui Zhao , Rui Zhou , Qingguo Zhou

ReorientDiff: Diffusion Model based Reorientation for Object Manipulation

The ability to manipulate objects in a desired configurations is a fundamental requirement for robots to complete various practical applications. While certain goals can be achieved by picking and placing the objects of interest directly,…

Robotics · Computer Science 2023-09-18 Utkarsh A. Mishra , Yongxin Chen