Related papers: Learning Pose Specific Representations by Predicti…

Grasp2Vec: Learning Object Representations from Self-Supervised Grasping

Well structured visual representations can make robot learning faster and can improve generalization. In this paper, we study how we can acquire effective object-centric representations for robotic manipulation tasks without human labeling…

Robotics · Computer Science 2018-11-20 Eric Jang , Coline Devin , Vincent Vanhoucke , Sergey Levine

One-Shot Imitation Learning: A Pose Estimation Perspective

In this paper, we study imitation learning under the challenging setting of: (1) only a single demonstration, (2) no further data collection, and (3) no prior task or object knowledge. We show how, with these constraints, imitation learning…

Robotics · Computer Science 2023-10-19 Pietro Vitiello , Kamil Dreczkowski , Edward Johns

Learning Object Localization and 6D Pose Estimation from Simulation and Weakly Labeled Real Images

This work proposes a process for efficiently training a point-wise object detector that enables localizing objects and computing their 6D poses in cluttered and occluded scenes. Accurate pose estimation is typically a requirement for robust…

Computer Vision and Pattern Recognition · Computer Science 2019-02-22 Jean-Philippe Mercier , Chaitanya Mitash , Philippe Giguère , Abdeslam Boularias

Self-Supervised Category-Level Articulated Object Pose Estimation with Part-Level SE(3) Equivariance

Category-level articulated object pose estimation aims to estimate a hierarchy of articulation-aware object poses of an unseen articulated object from a known category. To reduce the heavy annotations needed for supervised learning methods,…

Computer Vision and Pattern Recognition · Computer Science 2023-03-01 Xueyi Liu , Ji Zhang , Ruizhen Hu , Haibin Huang , He Wang , Li Yi

You Only Look at One: Category-Level Object Representations for Pose Estimation From a Single Example

In order to meaningfully interact with the world, robot manipulators must be able to interpret objects they encounter. A critical aspect of this interpretation is pose estimation: inferring quantities that describe the position and…

Robotics · Computer Science 2023-05-23 Walter Goodwin , Ioannis Havoutis , Ingmar Posner

Matching Multiple Perspectives for Efficient Representation Learning

Representation learning approaches typically rely on images of objects captured from a single perspective that are transformed using affine transformations. Additionally, self-supervised learning, a successful paradigm of representation…

Computer Vision and Pattern Recognition · Computer Science 2022-08-17 Omiros Pantazis , Mathew Salvaris

Neural Pose Representation Learning for Generating and Transferring Non-Rigid Object Poses

We propose a novel method for learning representations of poses for 3D deformable objects, which specializes in 1) disentangling pose information from the object's identity, 2) facilitating the learning of pose variations, and 3)…

Computer Vision and Pattern Recognition · Computer Science 2024-11-05 Seungwoo Yoo , Juil Koo , Kyeongmin Yeo , Minhyuk Sung

Unsupervised Robust Disentangling of Latent Characteristics for Image Synthesis

Deep generative models come with the promise to learn an explainable representation for visual objects that allows image sampling, synthesis, and selective modification. The main challenge is to learn to properly model the independent…

Computer Vision and Pattern Recognition · Computer Science 2019-10-24 Patrick Esser , Johannes Haux , Björn Ommer

PAOLI: Pose-free Articulated Object Learning from Sparse-view Images

We present a methodology to model articulated objects using a sparse set of images with unknown poses. Current methods require dense multi-view observations and ground-truth camera poses. Our approach operates with as few as four views per…

Computer Vision and Pattern Recognition · Computer Science 2026-04-06 Jianning Deng , Kartic Subr , Hakan Bilen

Multi-Object Representation Learning with Iterative Variational Inference

Human perception is structured around objects which form the basis for our higher-level cognition and impressive systematic generalization abilities. Yet most work on representation learning focuses on feature learning without even…

Machine Learning · Computer Science 2020-07-29 Klaus Greff , Raphaël Lopez Kaufman , Rishabh Kabra , Nick Watters , Chris Burgess , Daniel Zoran , Loic Matthey , Matthew Botvinick , Alexander Lerchner

Unsupervised Learning of View-invariant Action Representations

The recent success in human action recognition with deep learning methods mostly adopt the supervised learning paradigm, which requires significant amount of manually labeled data to achieve good performance. However, label collection is an…

Computer Vision and Pattern Recognition · Computer Science 2018-09-07 Junnan Li , Yongkang Wong , Qi Zhao , Mohan S. Kankanhalli

Object-Based Visual Camera Pose Estimation From Ellipsoidal Model and 3D-Aware Ellipse Prediction

In this paper, we propose a method for initial camera pose estimation from just a single image which is robust to viewing conditions and does not require a detailed model of the scene. This method meets the growing need of easy deployment…

Computer Vision and Pattern Recognition · Computer Science 2022-03-10 Matthieu Zins , Gilles Simon , Marie-Odile Berger

Scene-level Pose Estimation for Multiple Instances of Densely Packed Objects

This paper introduces key machine learning operations that allow the realization of robust, joint 6D pose estimation of multiple instances of objects either densely packed or in unstructured piles from RGB-D data. The first objective is to…

Robotics · Computer Science 2019-10-14 Chaitanya Mitash , Bowen Wen , Kostas Bekris , Abdeslam Boularias

Semi-Supervised 3D Hand-Object Poses Estimation with Interactions in Time

Estimating 3D hand and object pose from a single image is an extremely challenging problem: hands and objects are often self-occluded during interactions, and the 3D annotations are scarce as even humans cannot directly label the…

Computer Vision and Pattern Recognition · Computer Science 2021-06-10 Shaowei Liu , Hanwen Jiang , Jiarui Xu , Sifei Liu , Xiaolong Wang

NeRF-Feat: 6D Object Pose Estimation using Feature Rendering

Object Pose Estimation is a crucial component in robotic grasping and augmented reality. Learning based approaches typically require training data from a highly accurate CAD model or labeled training data acquired using a complex setup. We…

Computer Vision and Pattern Recognition · Computer Science 2024-06-21 Shishir Reddy Vutukur , Heike Brock , Benjamin Busam , Tolga Birdal , Andreas Hutter , Slobodan Ilic

MURAUER: Mapping Unlabeled Real Data for Label AUstERity

Data labeling for learning 3D hand pose estimation models is a huge effort. Readily available, accurately labeled synthetic data has the potential to reduce the effort. However, to successfully exploit synthetic data, current…

Computer Vision and Pattern Recognition · Computer Science 2018-12-06 Georg Poier , Michael Opitz , David Schinagl , Horst Bischof

Learning a Category-level Object Pose Estimator without Pose Annotations

3D object pose estimation is a challenging task. Previous works always require thousands of object images with annotated poses for learning the 3D pose correspondence, which is laborious and time-consuming for labeling. In this paper, we…

Computer Vision and Pattern Recognition · Computer Science 2024-04-09 Fengrui Tian , Yaoyao Liu , Adam Kortylewski , Yueqi Duan , Shaoyi Du , Alan Yuille , Angtian Wang

A Study on Self-Supervised Object Detection Pretraining

In this work, we study different approaches to self-supervised pretraining of object detection models. We first design a general framework to learn a spatially consistent dense representation from an image, by randomly sampling and…

Computer Vision and Pattern Recognition · Computer Science 2022-08-12 Trung Dang , Simon Kornblith , Huy Thong Nguyen , Peter Chin , Maryam Khademi

Object Pose Estimation through Dexterous Touch

Robust object pose estimation is essential for manipulation and interaction tasks in robotics, particularly in scenarios where visual data is limited or sensitive to lighting, occlusions, and appearances. Tactile sensors often offer limited…

Robotics · Computer Science 2025-09-18 Amir-Hossein Shahidzadeh , Jiyue Zhu , Kezhou Chen , Sha Yi , Cornelia Fermüller , Yiannis Aloimonos , Xiaolong Wang

Unsupervised Geometry-Aware Representation for 3D Human Pose Estimation

Modern 3D human pose estimation techniques rely on deep networks, which require large amounts of training data. While weakly-supervised methods require less supervision, by utilizing 2D poses or multi-view imagery without annotations, they…

Computer Vision and Pattern Recognition · Computer Science 2018-04-05 Helge Rhodin , Mathieu Salzmann , Pascal Fua