Related papers: Object-Centric Multi-View Aggregation

Multi-view Human Pose and Shape Estimation Using Learnable Volumetric Aggregation

Human pose and shape estimation from RGB images is a highly sought after alternative to marker-based motion capture, which is laborious, requires expensive equipment, and constrains capture to laboratory environments. Monocular vision-based…

Computer Vision and Pattern Recognition · Computer Science 2020-11-30 Soyong Shin , Eni Halilaj

Multi-Part Object Representations via Graph Structures and Co-Part Discovery

Discovering object-centric representations from images can significantly enhance the robustness, sample efficiency and generalizability of vision models. Works on images with multi-part objects typically follow an implicit object…

Computer Vision and Pattern Recognition · Computer Science 2025-12-29 Alex Foo , Wynne Hsu , Mong Li Lee

Learning Canonical View Representation for 3D Shape Recognition with Arbitrary Views

In this paper, we focus on recognizing 3D shapes from arbitrary views, i.e., arbitrary numbers and positions of viewpoints. It is a challenging and realistic setting for view-based 3D shape recognition. We propose a canonical view…

Computer Vision and Pattern Recognition · Computer Science 2021-08-18 Xin Wei , Yifei Gong , Fudong Wang , Xing Sun , Jian Sun

MoreFusion: Multi-object Reasoning for 6D Pose Estimation from Volumetric Fusion

Robots and other smart devices need efficient object-based scene representations from their on-board vision systems to reason about contact, physics and occlusion. Recognized precise object models will play an important role alongside…

Computer Vision and Pattern Recognition · Computer Science 2020-04-10 Kentaro Wada , Edgar Sucar , Stephen James , Daniel Lenton , Andrew J. Davison

Geometric Capsule Autoencoders for 3D Point Clouds

We propose a method to learn object representations from 3D point clouds using bundles of geometrically interpretable hidden units, which we call geometric capsules. Each geometric capsule represents a visual entity, such as an object or a…

Machine Learning · Computer Science 2019-12-10 Nitish Srivastava , Hanlin Goh , Ruslan Salakhutdinov

Associative3D: Volumetric Reconstruction from Sparse Views

This paper studies the problem of 3D volumetric reconstruction from two views of a scene with an unknown camera. While seemingly easy for humans, this problem poses many challenges for computers since it requires simultaneously…

Computer Vision and Pattern Recognition · Computer Science 2020-07-28 Shengyi Qian , Linyi Jin , David F. Fouhey

Multi-view object pose estimation from correspondence distributions and epipolar geometry

In many automation tasks involving manipulation of rigid objects, the poses of the objects must be acquired. Vision-based pose estimation using a single RGB or RGB-D sensor is especially popular due to its broad applicability. However,…

Computer Vision and Pattern Recognition · Computer Science 2023-03-24 Rasmus Laurvig Haugaard , Thorbjørn Mosekjær Iversen

Sparse 3D Reconstruction via Object-Centric Ray Sampling

We propose a novel method for 3D object reconstruction from a sparse set of views captured from a 360-degree calibrated camera rig. We represent the object surface through a hybrid model that uses both an MLP-based neural representation and…

Computer Vision and Pattern Recognition · Computer Science 2024-03-29 Llukman Cerkezi , Paolo Favaro

Detailed Geometry and Appearance from Opportunistic Motion

Reconstructing 3D geometry and appearance from a sparse set of fixed cameras is a foundational task with broad applications, yet it remains fundamentally constrained by the limited viewpoints. We show that this bound can be broken by…

Computer Vision and Pattern Recognition · Computer Science 2026-03-30 Ryosuke Hirai , Kohei Yamashita , Antoine Guédon , Ryo Kawahara , Vincent Lepetit , Ko Nishino

Learning Canonical 3D Object Representation for Fine-Grained Recognition

We propose a novel framework for fine-grained object recognition that learns to recover object variation in 3D space from a single image, trained on an image collection without using any ground-truth 3D annotation. We accomplish this by…

Computer Vision and Pattern Recognition · Computer Science 2021-08-11 Sunghun Joung , Seungryong Kim , Minsu Kim , Ig-Jae Kim , Kwanghoon Sohn

ROOTS: Object-Centric Representation and Rendering of 3D Scenes

A crucial ability of human intelligence is to build up models of individual 3D objects from partial scene observations. Recent works achieve object-centric generation but without the ability to infer the representation, or achieve 3D scene…

Machine Learning · Computer Science 2021-07-05 Chang Chen , Fei Deng , Sungjin Ahn

Multiview Aggregation for Learning Category-Specific Shape Reconstruction

We investigate the problem of learning category-specific 3D shape reconstruction from a variable number of RGB views of previously unobserved object instances. Most approaches for multiview shape reconstruction operate on sparse shape…

Computer Vision and Pattern Recognition · Computer Science 2019-12-10 Srinath Sridhar , Davis Rempe , Julien Valentin , Sofien Bouaziz , Leonidas J. Guibas

Symmetry Aware Evaluation of 3D Object Detection and Pose Estimation in Scenes of Many Parts in Bulk

While 3D object detection and pose estimation has been studied for a long time, its evaluation is not yet completely satisfactory. Indeed, existing datasets typically consist in numerous acquisitions of only a few scenes because of the…

Computer Vision and Pattern Recognition · Computer Science 2018-06-22 Romain Brégier , Frédéric Devernay , Laetitia Leyrit , James Crowley

Volumetric Instance-Aware Semantic Mapping and 3D Object Discovery

To autonomously navigate and plan interactions in real-world environments, robots require the ability to robustly perceive and map complex, unstructured surrounding scenes. Besides building an internal representation of the observed scene…

Robotics · Computer Science 2021-05-18 Margarita Grinvald , Fadri Furrer , Tonci Novkovic , Jen Jen Chung , Cesar Cadena , Roland Siegwart , Juan Nieto

CylinderPlane: Nested Cylinder Representation for 3D-aware Image Generation

While the proposal of the Tri-plane representation has advanced the development of the 3D-aware image generative models, problems rooted in its inherent structure, such as multi-face artifacts caused by sharing the same features in…

Computer Vision and Pattern Recognition · Computer Science 2025-07-22 Ru Jia , Xiaozhuang Ma , Jianji Wang , Nanning Zheng

Compositional Scene Modeling with Global Object-Centric Representations

The appearance of the same object may vary in different scene images due to perspectives and occlusions between objects. Humans can easily identify the same object, even if occlusions exist, by completing the occluded parts based on its…

Computer Vision and Pattern Recognition · Computer Science 2022-11-28 Tonglin Chen , Bin Li , Zhimeng Shen , Xiangyang Xue

3D Scene Geometry Estimation from 360$^\circ$ Imagery: A Survey

This paper provides a comprehensive survey on pioneer and state-of-the-art 3D scene geometry estimation methodologies based on single, two, or multiple images captured under the omnidirectional optics. We first revisit the basic concepts of…

Computer Vision and Pattern Recognition · Computer Science 2024-01-18 Thiago Lopes Trugillo da Silveira , Paulo Gamarra Lessa Pinto , Jeffri Erwin Murrugarra Llerena , Claudio Rosito Jung

Counting Stacked Objects

Visual object counting is a fundamental computer vision task underpinning numerous real-world applications, from cell counting in biomedicine to traffic and wildlife monitoring. However, existing methods struggle to handle the challenge of…

Computer Vision and Pattern Recognition · Computer Science 2025-07-31 Corentin Dumery , Noa Etté , Aoxiang Fan , Ren Li , Jingyi Xu , Hieu Le , Pascal Fua

Variational Inference for Scalable 3D Object-centric Learning

We tackle the task of scalable unsupervised object-centric representation learning on 3D scenes. Existing approaches to object-centric representation learning show limitations in generalizing to larger scenes as their learning processes…

Computer Vision and Pattern Recognition · Computer Science 2023-09-26 Tianyu Wang , Kee Siong Ng , Miaomiao Liu

Object Pose Transformer: Unifying Unseen Object Pose Estimation

Learning model-free object pose estimation for unseen instances remains a fundamental challenge in 3D vision. Existing methods typically fall into two disjoint paradigms: category-level approaches predict absolute poses in a canonical space…

Computer Vision and Pattern Recognition · Computer Science 2026-03-25 Weihang Li , Lorenzo Garattoni , Fabien Despinoy , Nassir Navab , Benjamin Busam