Related papers: Learning View Generalization Functions

Investigating the Nature of 3D Generalization in Deep Neural Networks

Visual object recognition systems need to generalize from a set of 2D training views to novel views. The question of how the human visual system can generalize to novel views has been studied and modeled in psychology, computer vision, and…

Computer Vision and Pattern Recognition · Computer Science 2023-04-20 Shoaib Ahmed Siddiqui , David Krueger , Thomas Breuel

Improving Generalization Ability for 3D Object Detection by Learning Sparsity-invariant Features

In autonomous driving, 3D object detection is essential for accurately identifying and tracking objects. Despite the continuous development of various technologies for this task, a significant drawback is observed in most of them-they…

Computer Vision and Pattern Recognition · Computer Science 2025-02-05 Hsin-Cheng Lu , Chung-Yi Lin , Winston H. Hsu

View Generalization for Single Image Textured 3D Models

Humans can easily infer the underlying 3D geometry and texture of an object only from a single 2D image. Current computer vision methods can do this, too, but suffer from view generalization problems - the models inferred tend to make poor…

Computer Vision and Pattern Recognition · Computer Science 2021-06-14 Anand Bhattad , Aysegul Dundar , Guilin Liu , Andrew Tao , Bryan Catanzaro

Geometry-Guided Modeling of Foundation Features Enables Generalizable Object Shape Deformation Learning

Monocular 3D shape recovery is fundamental to geometric understanding, yet achieving robust generalization across arbitrary viewpoints and unseen object categories remains a significant challenge. In this paper, we present a generalizable…

Computer Vision and Pattern Recognition · Computer Science 2026-05-29 Yiyao Ma , Kai Chen , Zhongxiang Zhou , Zhuheng Song , Dongsheng Xie , Zelong Tan , Rong Xiong , Qi Dou

Deep Models for Multi-View 3D Object Recognition: A Review

Human decision-making often relies on visual information from multiple perspectives or views. In contrast, machine learning-based object recognition utilizes information from a single image of the object. However, the information conveyed…

Computer Vision and Pattern Recognition · Computer Science 2025-10-01 Mona Alzahrani , Muhammad Usman , Salma Kammoun , Saeed Anwar , Tarek Helmy

Fostering Generalization in Single-view 3D Reconstruction by Learning a Hierarchy of Local and Global Shape Priors

Single-view 3D object reconstruction has seen much progress, yet methods still struggle generalizing to novel shapes unseen during training. Common approaches predominantly rely on learned global shape priors and, hence, disregard detailed…

Computer Vision and Pattern Recognition · Computer Science 2021-04-02 Jan Bechtold , Maxim Tatarchenko , Volker Fischer , Thomas Brox

View Based Methods can achieve Bayes-Optimal 3D Recognition

This paper proves that visual object recognition systems using only 2D Euclidean similarity measurements to compare object views against previously seen views can achieve the same recognition performance as observers having access to all…

Computer Vision and Pattern Recognition · Computer Science 2007-12-04 Thomas M. Breuel

Multiview Aggregation for Learning Category-Specific Shape Reconstruction

We investigate the problem of learning category-specific 3D shape reconstruction from a variable number of RGB views of previously unobserved object instances. Most approaches for multiview shape reconstruction operate on sparse shape…

Computer Vision and Pattern Recognition · Computer Science 2019-12-10 Srinath Sridhar , Davis Rempe , Julien Valentin , Sofien Bouaziz , Leonidas J. Guibas

Towards Generalizable Multi-Camera 3D Object Detection via Perspective Debiasing

Detecting objects in 3D space using multiple cameras, known as Multi-Camera 3D Object Detection (MC3D-Det), has gained prominence with the advent of bird's-eye view (BEV) approaches. However, these methods often struggle when faced with…

Computer Vision and Pattern Recognition · Computer Science 2023-12-27 Hao Lu , Yunpeng Zhang , Qing Lian , Dalong Du , Yingcong Chen

Vision-based Robot Manipulation Learning via Human Demonstrations

Vision-based learning methods provide promise for robots to learn complex manipulation tasks. However, how to generalize the learned manipulation skills to real-world interactions remains an open question. In this work, we study robotic…

Robotics · Computer Science 2020-03-03 Zhixin Jia , Mengxiang Lin , Zhixin Chen , Shibo Jian

Anyview: Generalizable Indoor 3D Object Detection with Variable Frames

In this paper, we propose a novel network framework for indoor 3D object detection to handle variable input frame numbers in practical scenarios. Existing methods only consider fixed frames of input data for a single detector, such as…

Computer Vision and Pattern Recognition · Computer Science 2025-07-03 Zhenyu Wu , Xiuwei Xu , Ziwei Wang , Chong Xia , Linqing Zhao , Jiwen Lu , Haibin Yan

Generalizable Human Gaussians for Sparse View Synthesis

Recent progress in neural rendering has brought forth pioneering methods, such as NeRF and Gaussian Splatting, which revolutionize view rendering across various domains like AR/VR, gaming, and content creation. While these methods excel at…

Computer Vision and Pattern Recognition · Computer Science 2024-07-18 Youngjoong Kwon , Baole Fang , Yixing Lu , Haoye Dong , Cheng Zhang , Francisco Vicente Carrasco , Albert Mosella-Montoro , Jianjin Xu , Shingo Takagi , Daeil Kim , Aayush Prakash , Fernando De la Torre

AutoRF: Learning 3D Object Radiance Fields from Single View Observations

We introduce AutoRF - a new approach for learning neural 3D object representations where each object in the training set is observed by only a single view. This setting is in stark contrast to the majority of existing works that leverage…

Computer Vision and Pattern Recognition · Computer Science 2022-04-08 Norman Müller , Andrea Simonelli , Lorenzo Porzi , Samuel Rota Bulò , Matthias Nießner , Peter Kontschieder

Object Learning and Robust 3D Reconstruction

In this thesis we discuss architectural designs and training methods for a neural network to have the ability of dissecting an image into objects of interest without supervision. The main challenge in 2D unsupervised object segmentation is…

Computer Vision and Pattern Recognition · Computer Science 2025-04-28 Sara Sabour

Photo-Geometric Autoencoding to Learn 3D Objects from Unlabelled Images

We show that generative models can be used to capture visual geometry constraints statistically. We use this fact to infer the 3D shape of object categories from raw single-view images. Differently from prior work, we use no external…

Computer Vision and Pattern Recognition · Computer Science 2019-06-05 Shangzhe Wu , Christian Rupprecht , Andrea Vedaldi

Object Detection, Recognition, Deep Learning, and the Universal Law of Generalization

Object detection and recognition are fundamental functions underlying the success of species. Because the appearance of an object exhibits a large variability, the brain has to group these different stimuli under the same object identity, a…

Machine Learning · Computer Science 2022-06-14 Faris B. Rustom , Haluk Öğmen , Arash Yazdanbakhsh

Generalizing Single-View 3D Shape Retrieval to Occlusions and Unseen Objects

Single-view 3D shape retrieval is a challenging task that is increasingly important with the growth of available 3D data. Prior work that has studied this task has not focused on evaluating how realistic occlusions impact performance, and…

Computer Vision and Pattern Recognition · Computer Science 2024-01-02 Qirui Wu , Daniel Ritchie , Manolis Savva , Angel X. Chang

View Inter-Prediction GAN: Unsupervised Representation Learning for 3D Shapes by Learning Global Shape Memories to Support Local View Predictions

In this paper we present a novel unsupervised representation learning approach for 3D shapes, which is an important research challenge as it avoids the manual effort required for collecting supervised data. Our method trains an RNN-based…

Computer Vision and Pattern Recognition · Computer Science 2018-11-08 Zhizhong Han , Mingyang Shang , Yu-Shen Liu , Matthias Zwicker

Generalizable task representation learning from human demonstration videos: a geometric approach

We study the problem of generalizable task learning from human demonstration videos without extra training on the robot or pre-recorded robot motions. Given a set of human demonstration videos showing a task with different objects/tools…

Robotics · Computer Science 2022-03-01 Jun Jin , Martin Jagersand

Self-Supervised Learning of Object Parts for Semantic Segmentation

Progress in self-supervised learning has brought strong general image representation learning methods. Yet so far, it has mostly focused on image-level learning. In turn, tasks such as unsupervised image segmentation have not benefited from…

Computer Vision and Pattern Recognition · Computer Science 2022-06-22 Adrian Ziegler , Yuki M. Asano