English
Related papers

Related papers: DC3DO: Diffusion Classifier for 3D Objects

200 papers

The recent wave of large-scale text-to-image diffusion models has dramatically increased our text-based image generation abilities. These models can generate realistic images for a staggering variety of prompts and exhibit impressive…

Machine Learning · Computer Science 2023-09-14 Alexander C. Li , Mihir Prabhudesai , Shivam Duggal , Ellis Brown , Deepak Pathak

3D object detection is an essential task for achieving autonomous driving. Existing anchor-based detection methods rely on empirical heuristics setting of anchors, which makes the algorithms lack elegance. In recent years, we have witnessed…

Computer Vision and Pattern Recognition · Computer Science 2023-09-06 Xin Zhou , Jinghua Hou , Tingting Yao , Dingkang Liang , Zhe Liu , Zhikang Zou , Xiaoqing Ye , Jianwei Cheng , Xiang Bai

In this paper, we learn a diffusion model to generate 3D data on a scene-scale. Specifically, our model crafts a 3D scene consisting of multiple objects, while recent diffusion research has focused on a single object. To realize our goal,…

Computer Vision and Pattern Recognition · Computer Science 2023-01-03 Jumin Lee , Woobin Im , Sebin Lee , Sung-Eui Yoon

Understanding visual scenes is fundamental to human intelligence. While discriminative models have significantly advanced computer vision, they often struggle with compositional understanding. In contrast, recent generative text-to-image…

Computer Vision and Pattern Recognition · Computer Science 2025-11-04 Yujin Jeong , Arnas Uselis , Seong Joon Oh , Anna Rohrbach

Detecting objects seamlessly blended into their surroundings represents a complex task for both human cognitive capabilities and advanced artificial intelligence algorithms. Currently, the majority of methodologies for detecting camouflaged…

Computer Vision and Pattern Recognition · Computer Science 2024-07-19 Jianwei Zhao , Xin Li , Fan Yang , Qiang Zhai , Ao Luo , Zicheng Jiao , Hong Cheng

Diffusion models have recently gained prominence as powerful deep generative models, demonstrating unmatched performance across various domains. However, their potential in multi-sensor fusion remains largely unexplored. In this work, we…

Computer Vision and Pattern Recognition · Computer Science 2024-09-25 Duy-Tho Le , Hengcan Shi , Jianfei Cai , Hamid Rezatofighi

Probabilistic denoising diffusion models (DDMs) have set a new standard for 2D image generation. Extending DDMs for 3D content creation is an active field of research. Here, we propose TetraDiffusion, a diffusion model that operates on a…

Computer Vision and Pattern Recognition · Computer Science 2024-08-12 Nikolai Kalischek , Torben Peters , Jan D. Wegner , Konrad Schindler

Diffusion models have emerged as a powerful generative method, capable of producing stunning photo-realistic images from natural language descriptions. However, these models lack explicit control over the 3D structure in the generated…

Computer Vision and Pattern Recognition · Computer Science 2024-04-05 Wufei Ma , Qihao Liu , Jiahao Wang , Angtian Wang , Xiaoding Yuan , Yi Zhang , Zihao Xiao , Guofeng Zhang , Beijia Lu , Ruxiao Duan , Yongrui Qi , Adam Kortylewski , Yaoyao Liu , Alan Yuille

Discriminative classifiers have become a foundational tool in deep learning for medical imaging, excelling at learning separable features of complex data distributions. However, these models often need careful design, augmentation, and…

Computer Vision and Pattern Recognition · Computer Science 2025-08-11 Gian Mario Favero , Parham Saremi , Emily Kaczmarek , Brennan Nichyporuk , Tal Arbel

In this paper, we tackle a new task of 3D object synthesis, where a 3D model is composited with another object category to create a novel 3D model. However, most existing text/image/3D-to-3D methods struggle to effectively integrate…

Computer Vision and Pattern Recognition · Computer Science 2025-09-03 Zeren Xiong , Zikun Chen , Zedong Zhang , Xiang Li , Ying Tai , Jian Yang , Jun Li

Object pose estimation from a single view remains a challenging problem. In particular, partial observability, occlusions, and object symmetries eventually result in pose ambiguity. To account for this multimodality, this work proposes…

Computer Vision and Pattern Recognition · Computer Science 2024-12-03 Christian Möller , Niklas Funk , Jan Peters

3D object detection often involves complicated training and testing pipelines, which require substantial domain knowledge about individual datasets. Inspired by recent non-maximum suppression-free 2D object detection models, we propose a 3D…

Computer Vision and Pattern Recognition · Computer Science 2021-10-14 Yue Wang , Justin Solomon

Controllable generation of 3D assets is important for many practical applications like content creation in movies, games and engineering, as well as in AR/VR. Recently, diffusion models have shown remarkable results in generation quality of…

Computer Vision and Pattern Recognition · Computer Science 2024-08-01 Philipp Schröppel , Christopher Wewer , Jan Eric Lenssen , Eddy Ilg , Thomas Brox

Object detectors often suffer a decrease in performance due to the large domain gap between the training data (source domain) and real-world data (target domain). Diffusion-based generative models have shown remarkable abilities in…

Computer Vision and Pattern Recognition · Computer Science 2025-06-05 Boyong He , Yuxiang Ji , Zhuoyue Tan , Liaoni Wu

We present 3DiffTection, a state-of-the-art method for 3D object detection from single images, leveraging features from a 3D-aware diffusion model. Annotating large-scale image data for 3D detection is resource-intensive and time-consuming.…

Computer Vision and Pattern Recognition · Computer Science 2023-11-09 Chenfeng Xu , Huan Ling , Sanja Fidler , Or Litany

Diffusion models have emerged as the best approach for generative modeling of 2D images. Part of their success is due to the possibility of training them on millions if not billions of images with a stable learning objective. However,…

Computer Vision and Pattern Recognition · Computer Science 2023-05-23 Animesh Karnewar , Andrea Vedaldi , David Novotny , Niloy Mitra

Accurately predicting 3D occupancy grids from visual inputs is critical for autonomous driving, but current discriminative methods struggle with noisy data, incomplete observations, and the complex structures inherent in 3D scenes. In this…

Computer Vision and Pattern Recognition · Computer Science 2025-07-04 Yunshen Wang , Yicheng Liu , Tianyuan Yuan , Yingshi Liang , Xiuyu Yang , Honggang Zhang , Hang Zhao

We propose DiffusionDet, a new framework that formulates object detection as a denoising diffusion process from noisy boxes to object boxes. During the training stage, object boxes diffuse from ground-truth boxes to random distribution, and…

Computer Vision and Pattern Recognition · Computer Science 2023-08-22 Shoufa Chen , Peize Sun , Yibing Song , Ping Luo

Object-centric learning aims to represent visual data with a set of object entities (a.k.a. slots), providing structured representations that enable systematic generalization. Leveraging advanced architectures like Transformers, recent…

Computer Vision and Pattern Recognition · Computer Science 2023-09-25 Ziyi Wu , Jingyu Hu , Wuyue Lu , Igor Gilitschenski , Animesh Garg

In recent years, 3D vision has become a crucial field within computer vision, powering a wide range of applications such as autonomous driving, robotics, augmented reality, and medical imaging. This field relies on accurate perception,…

Computer Vision and Pattern Recognition · Computer Science 2025-04-02 Zhen Wang , Dongyuan Li , Yaozu Wu , Tianyu He , Jiang Bian , Renhe Jiang
‹ Prev 1 2 3 10 Next ›