Related papers: Shape and Viewpoint without Keypoints

Learning Category-Specific Mesh Reconstruction from Image Collections

We present a learning framework for recovering the 3D shape, camera, and texture of an object from a single image. The shape is represented as a deformable 3D mesh model of an object category where a shape is parameterized by a learned mean…

Computer Vision and Pattern Recognition · Computer Science 2018-08-01 Angjoo Kanazawa , Shubham Tulsiani , Alexei A. Efros , Jitendra Malik

Planes vs. Chairs: Category-guided 3D shape learning without any 3D cues

We present a novel 3D shape reconstruction method which learns to predict an implicit 3D shape representation from a single RGB image. Our approach uses a set of single-view images of multiple object categories without viewpoint annotation,…

Computer Vision and Pattern Recognition · Computer Science 2022-04-22 Zixuan Huang , Stefan Stojanov , Anh Thai , Varun Jampani , James M. Rehg

Multi-Category Mesh Reconstruction From Image Collections

Recently, learning frameworks have shown the capability of inferring the accurate shape, pose, and texture of an object from a single RGB image. However, current methods are trained on image collections of a single category in order to…

Computer Vision and Pattern Recognition · Computer Science 2021-10-22 Alessandro Simoni , Stefano Pini , Roberto Vezzani , Rita Cucchiara

Unsupervised Learning of Object Structure and Dynamics from Videos

Extracting and predicting object structure and dynamics from videos without supervision is a major challenge in machine learning. To address this challenge, we adopt a keypoint-based image representation and learn a stochastic dynamics…

Computer Vision and Pattern Recognition · Computer Science 2020-03-03 Matthias Minderer , Chen Sun , Ruben Villegas , Forrester Cole , Kevin Murphy , Honglak Lee

Implicit Mesh Reconstruction from Unannotated Image Collections

We present an approach to infer the 3D shape, texture, and camera pose for an object from a single RGB image, using only category-level image collections with foreground masks as supervision. We represent the shape as an image-conditioned…

Computer Vision and Pattern Recognition · Computer Science 2020-07-17 Shubham Tulsiani , Nilesh Kulkarni , Abhinav Gupta

Learning 3D Object Shape and Layout without 3D Supervision

A 3D scene consists of a set of objects, each with a shape and a layout giving their position in space. Understanding 3D scenes from 2D images is an important goal, with applications in robotics and graphics. While there have been recent…

Computer Vision and Pattern Recognition · Computer Science 2022-06-15 Georgia Gkioxari , Nikhila Ravi , Justin Johnson

Self-supervised Single-view 3D Reconstruction via Semantic Consistency

We learn a self-supervised, single-view 3D reconstruction model that predicts the 3D mesh shape, texture and camera pose of a target object with a collection of 2D images and silhouettes. The proposed method does not necessitate 3D…

Computer Vision and Pattern Recognition · Computer Science 2020-03-17 Xueting Li , Sifei Liu , Kihwan Kim , Shalini De Mello , Varun Jampani , Ming-Hsuan Yang , Jan Kautz

Self-supervised 3D Shape and Viewpoint Estimation from Single Images for Robotics

We present a convolutional neural network for joint 3D shape prediction and viewpoint estimation from a single input image. During training, our network gets the learning signal from a silhouette of an object in the input image - a form of…

Robotics · Computer Science 2019-10-18 Oier Mees , Maxim Tatarchenko , Thomas Brox , Wolfram Burgard

Photo-Geometric Autoencoding to Learn 3D Objects from Unlabelled Images

We show that generative models can be used to capture visual geometry constraints statistically. We use this fact to infer the 3D shape of object categories from raw single-view images. Differently from prior work, we use no external…

Computer Vision and Pattern Recognition · Computer Science 2019-06-05 Shangzhe Wu , Christian Rupprecht , Andrea Vedaldi

Toward Realistic Single-View 3D Object Reconstruction with Unsupervised Learning from Multiple Images

Recovering the 3D structure of an object from a single image is a challenging task due to its ill-posed nature. One approach is to utilize the plentiful photos of the same object category to learn a strong 3D shape prior for the object.…

Computer Vision and Pattern Recognition · Computer Science 2021-09-08 Long-Nhat Ho , Anh Tuan Tran , Quynh Phung , Minh Hoai

ViewNet: Unsupervised Viewpoint Estimation from Conditional Generation

Understanding the 3D world without supervision is currently a major challenge in computer vision as the annotations required to supervise deep networks for tasks in this domain are expensive to obtain on a large scale. In this paper, we…

Computer Vision and Pattern Recognition · Computer Science 2022-12-02 Octave Mariotti , Oisin Mac Aodha , Hakan Bilen

Self-supervised learning of object pose estimation using keypoint prediction

This paper describes recent developments in object specific pose and shape prediction from single images. The main contribution is a new approach to camera pose prediction by self-supervised learning of keypoints corresponding to locations…

Computer Vision and Pattern Recognition · Computer Science 2023-02-21 Zahra Gharaee , Felix Järemo Lawin , Per-Erik Forssén

Unsupervised 3D Keypoint Discovery with Multi-View Geometry

Analyzing and training 3D body posture models depend heavily on the availability of joint labels that are commonly acquired through laborious manual annotation of body joints or via marker-based joint localization using carefully curated…

Computer Vision and Pattern Recognition · Computer Science 2024-02-09 Sina Honari , Chen Zhao , Mathieu Salzmann , Pascal Fua

Unsupervised Learning of Shape and Pose with Differentiable Point Clouds

We address the problem of learning accurate 3D shape and camera pose from a collection of unlabeled category-specific images. We train a convolutional network to predict both the shape and the pose from a single image by minimizing the…

Computer Vision and Pattern Recognition · Computer Science 2018-10-23 Eldar Insafutdinov , Alexey Dosovitskiy

Unsupervised Learning of Probably Symmetric Deformable 3D Objects from Images in the Wild

We propose a method to learn 3D deformable object categories from raw single-view images, without external supervision. The method is based on an autoencoder that factors each input image into depth, albedo, viewpoint and illumination. In…

Computer Vision and Pattern Recognition · Computer Science 2020-04-01 Shangzhe Wu , Christian Rupprecht , Andrea Vedaldi

Unsupervised Learning of Visual 3D Keypoints for Control

Learning sensorimotor control policies from high-dimensional images crucially relies on the quality of the underlying visual representations. Prior works show that structured latent space such as visual keypoints often outperforms…

Machine Learning · Computer Science 2021-06-15 Boyuan Chen , Pieter Abbeel , Deepak Pathak

Learning to Recover 3D Scene Shape from a Single Image

Despite significant progress in monocular depth estimation in the wild, recent state-of-the-art methods cannot be used to recover accurate 3D scene shape due to an unknown depth shift induced by shift-invariant reconstruction losses used in…

Computer Vision and Pattern Recognition · Computer Science 2020-12-18 Wei Yin , Jianming Zhang , Oliver Wang , Simon Niklaus , Long Mai , Simon Chen , Chunhua Shen

KeyPointDiffuser: Unsupervised 3D Keypoint Learning via Latent Diffusion Models

Understanding and representing the structure of 3D objects in an unsupervised manner remains a core challenge in computer vision and graphics. Most existing unsupervised keypoint methods are not designed for unconditional generative…

Computer Vision and Pattern Recognition · Computer Science 2025-12-04 Rhys Newbury , Juyan Zhang , Tin Tran , Hanna Kurniawati , Dana Kulić

Learning to Generate and Reconstruct 3D Meshes with only 2D Supervision

We present a unified framework tackling two problems: class-specific 3D reconstruction from a single image, and generation of new 3D shape samples. These tasks have received considerable attention recently; however, existing approaches rely…

Computer Vision and Pattern Recognition · Computer Science 2018-11-16 Paul Henderson , Vittorio Ferrari

Self-supervised Learning of Detailed 3D Face Reconstruction

In this paper, we present an end-to-end learning framework for detailed 3D face reconstruction from a single image. Our approach uses a 3DMM-based coarse model and a displacement map in UV-space to represent a 3D face. Unlike previous work…

Computer Vision and Pattern Recognition · Computer Science 2020-09-03 Yajing Chen , Fanzi Wu , Zeyu Wang , Yibing Song , Yonggen Ling , Linchao Bao