Related papers: Variable-Viewpoint Representations for 3D Object R…

Deep Models for Multi-View 3D Object Recognition: A Review

Human decision-making often relies on visual information from multiple perspectives or views. In contrast, machine learning-based object recognition utilizes information from a single image of the object. However, the information conveyed…

Computer Vision and Pattern Recognition · Computer Science 2025-10-01 Mona Alzahrani , Muhammad Usman , Salma Kammoun , Saeed Anwar , Tarek Helmy

A survey on Deep Learning Advances on Different 3D Data Representations

3D data is a valuable asset the computer vision filed as it provides rich information about the full geometry of sensed objects and scenes. Recently, with the availability of both large 3D datasets and computational power, it is today…

Computer Vision and Pattern Recognition · Computer Science 2019-04-09 Eman Ahmed , Alexandre Saint , Abd El Rahman Shabayek , Kseniya Cherenkova , Rig Das , Gleb Gusev , Djamila Aouada , Bjorn Ottersten

A Survey on Deep Geometry Learning: From a Representation Perspective

Researchers have now achieved great success on dealing with 2D images using deep learning. In recent years, 3D computer vision and Geometry Deep Learning gain more and more attention. Many advanced techniques for 3D shapes have been…

Graphics · Computer Science 2020-04-16 Yun-Peng Xiao , Yu-Kun Lai , Fang-Lue Zhang , Chunpeng Li , Lin Gao

Variational Inference for Scalable 3D Object-centric Learning

We tackle the task of scalable unsupervised object-centric representation learning on 3D scenes. Existing approaches to object-centric representation learning show limitations in generalizing to larger scenes as their learning processes…

Computer Vision and Pattern Recognition · Computer Science 2023-09-26 Tianyu Wang , Kee Siong Ng , Miaomiao Liu

3D Keypoint Estimation Using Implicit Representation Learning

In this paper, we tackle the challenging problem of 3D keypoint estimation of general objects using a novel implicit representation. Previous works have demonstrated promising results for keypoint prediction through direct coordinate…

Computer Vision and Pattern Recognition · Computer Science 2023-06-21 Xiangyu Zhu , Dong Du , Haibin Huang , Chongyang Ma , Xiaoguang Han

Learning 3D object-centric representation through prediction

As part of human core knowledge, the representation of objects is the building block of mental representation that supports high-level concepts and symbolic reasoning. While humans develop the ability of perceiving objects situated in 3D…

Computer Vision and Pattern Recognition · Computer Science 2024-03-07 John Day , Tushar Arora , Jirui Liu , Li Erran Li , Ming Bo Cai

3D Object Recognition By Corresponding and Quantizing Neural 3D Scene Representations

We propose a system that learns to detect objects and infer their 3D poses in RGB-D images. Many existing systems can identify objects and infer 3D poses, but they heavily rely on human labels and 3D annotations. The challenge here is to…

Computer Vision and Pattern Recognition · Computer Science 2020-11-02 Mihir Prabhudesai , Shamit Lal , Hsiao-Yu Fish Tung , Adam W. Harley , Shubhankar Potdar , Katerina Fragkiadaki

Tracking by 3D Model Estimation of Unknown Objects in Videos

Most model-free visual object tracking methods formulate the tracking task as object location estimation given by a 2D segmentation or a bounding box in each video frame. We argue that this representation is limited and instead propose to…

Computer Vision and Pattern Recognition · Computer Science 2023-04-14 Denys Rozumnyi , Jiri Matas , Marc Pollefeys , Vittorio Ferrari , Martin R. Oswald

Multi-Object Representation Learning with Iterative Variational Inference

Human perception is structured around objects which form the basis for our higher-level cognition and impressive systematic generalization abilities. Yet most work on representation learning focuses on feature learning without even…

Machine Learning · Computer Science 2020-07-29 Klaus Greff , Raphaël Lopez Kaufman , Rishabh Kabra , Nick Watters , Chris Burgess , Daniel Zoran , Loic Matthey , Matthew Botvinick , Alexander Lerchner

Open-Ended Fine-Grained 3D Object Categorization by Combining Shape and Texture Features in Multiple Colorspaces

As a consequence of an ever-increasing number of service robots, there is a growing demand for highly accurate real-time 3D object recognition. Considering the expansion of robot applications in more complex and dynamic environments,it is…

Computer Vision and Pattern Recognition · Computer Science 2021-06-01 Nils Keunecke , S. Hamidreza Kasaei

Learning to Reconstruct and Segment 3D Objects

To endow machines with the ability to perceive the real-world in a three dimensional representation as we do as humans is a fundamental and long-standing topic in Artificial Intelligence. Given different types of visual inputs such as…

Computer Vision and Pattern Recognition · Computer Science 2020-10-20 Bo Yang

Review on 6D Object Pose Estimation with the focus on Indoor Scene Understanding

6D object pose estimation problem has been extensively studied in the field of Computer Vision and Robotics. It has wide range of applications such as robot manipulation, augmented reality, and 3D scene understanding. With the advent of…

Computer Vision and Pattern Recognition · Computer Science 2023-04-13 Negar Nejatishahidin , Pooya Fayyazsanavi

Visual Probing and Correction of Object Recognition Models with Interactive user feedback

With the advent of state-of-the-art machine learning and deep learning technologies, several industries are moving towards the field. Applications of such technologies are highly diverse ranging from natural language processing to computer…

Computer Vision and Pattern Recognition · Computer Science 2021-01-01 Viny Saajan Victor , Pramod Vadiraja , Jan-Tobias Sohns , Heike Leitte

Deep Level Sets: Implicit Surface Representations for 3D Shape Inference

Existing 3D surface representation approaches are unable to accurately classify pixels and their orientation lying on the boundary of an object. Thus resulting in coarse representations which usually require post-processing steps to extract…

Computer Vision and Pattern Recognition · Computer Science 2019-01-23 Mateusz Michalkiewicz , Jhony K. Pontes , Dominic Jack , Mahsa Baktashmotlagh , Anders Eriksson

Multiview Representation Learning from Crowdsourced Triplet Comparisons

Crowdsourcing has been used to collect data at scale in numerous fields. Triplet similarity comparison is a type of crowdsourcing task, in which crowd workers are asked the question ``among three given objects, which two are more…

Human-Computer Interaction · Computer Science 2023-02-09 Xiaotian Lu , Jiyi Li , Koh Takeuchi , Hisashi Kashima

Deep Learning for Generic Object Detection: A Survey

Object detection, one of the most fundamental and challenging problems in computer vision, seeks to locate object instances from a large number of predefined categories in natural images. Deep learning techniques have emerged as a powerful…

Computer Vision and Pattern Recognition · Computer Science 2019-08-23 Li Liu , Wanli Ouyang , Xiaogang Wang , Paul Fieguth , Jie Chen , Xinwang Liu , Matti Pietikäinen

Deep Learning Multi-View Representation for Face Recognition

Various factors, such as identities, views (poses), and illuminations, are coupled in face images. Disentangling the identity and view representations is a major challenge in face recognition. Existing face recognition systems either use…

Computer Vision and Pattern Recognition · Computer Science 2014-06-27 Zhenyao Zhu , Ping Luo , Xiaogang Wang , Xiaoou Tang

Deep Appearance Models for Face Rendering

We introduce a deep appearance model for rendering the human face. Inspired by Active Appearance Models, we develop a data-driven rendering pipeline that learns a joint representation of facial geometry and appearance from a multiview…

Graphics · Computer Science 2018-08-02 Stephen Lombardi , Jason Saragih , Tomas Simon , Yaser Sheikh

Simultaneous Multi-View Object Recognition and Grasping in Open-Ended Domains

To aid humans in everyday tasks, robots need to know which objects exist in the scene, where they are, and how to grasp and manipulate them in different situations. Therefore, object recognition and grasping are two key functionalities for…

Robotics · Computer Science 2022-12-07 Hamidreza Kasaei , Sha Luo , Remo Sasso , Mohammadreza Kasaei

Object as Distribution

Object detection is a critical part of visual scene understanding. The representation of the object in the detection task has important implications on the efficiency and feasibility of annotation, robustness to occlusion, pose, lighting,…

Computer Vision and Pattern Recognition · Computer Science 2019-07-31 Li Ding , Lex Fridman