Related papers: SABER-6D: Shape Representation Based Implicit Obje…

Implicit 3D Orientation Learning for 6D Object Detection from RGB Images

We propose a real-time RGB-based pipeline for object detection and 6D pose estimation. Our novel 3D orientation estimation is based on a variant of the Denoising Autoencoder that is trained on simulated views of a 3D model using Domain…

Computer Vision and Pattern Recognition · Computer Science 2019-07-18 Martin Sundermeyer , Zoltan-Csaba Marton , Maximilian Durner , Manuel Brucker , Rudolph Triebel

OPFormer: Object Pose Estimation leveraging foundation model with geometric encoding

We introduce a unified, end-to-end framework that seamlessly integrates object detection and pose estimation with a versatile onboarding process. Our pipeline begins with an onboarding stage that generates object representations from either…

Computer Vision and Pattern Recognition · Computer Science 2025-11-18 Artem Moroz , Vít Zeman , Martin Mikšík , Elizaveta Isianova , Miroslav David , Pavel Burget , Varun Burde

DISP6D: Disentangled Implicit Shape and Pose Learning for Scalable 6D Pose Estimation

Scalable 6D pose estimation for rigid objects from RGB images aims at handling multiple objects and generalizing to novel objects. Building on a well-known auto-encoding framework to cope with object symmetry and the lack of labeled…

Computer Vision and Pattern Recognition · Computer Science 2023-03-13 Yilin Wen , Xiangyu Li , Hao Pan , Lei Yang , Zheng Wang , Taku Komura , Wenping Wang

ShapeEmbed: a self-supervised learning framework for 2D contour quantification

The shape of objects is an important source of visual information in a wide range of applications. One of the core challenges of shape quantification is to ensure that the extracted measurements remain invariant to transformations that…

Computer Vision and Pattern Recognition · Computer Science 2025-07-02 Anna Foix Romero , Craig Russell , Alexander Krull , Virginie Uhlmann

Category-Level 6D Object Pose Estimation with Flexible Vector-Based Rotation Representation

In this paper, we propose a novel 3D graph convolution based pipeline for category-level 6D pose and size estimation from monocular RGB-D images. The proposed method leverages an efficient 3D data augmentation and a novel vector-based…

Computer Vision and Pattern Recognition · Computer Science 2023-01-31 Wei Chen , Xi Jia , Zhongqun Zhang , Hyung Jin Chang , Linlin Shen , Jinming Duan , Ales Leonardis

Towards Symmetry-sensitive Pose Estimation: A Rotation Representation for Symmetric Object Classes

Symmetric objects are common in daily life and industry, yet their inherent orientation ambiguities that impede the training of deep learning networks for pose estimation are rarely discussed in the literature. To cope with these…

Computer Vision and Pattern Recognition · Computer Science 2026-04-21 Andreas Kriegler , Csaba Beleznai , Margrit Gelautz

Neural Pose Representation Learning for Generating and Transferring Non-Rigid Object Poses

We propose a novel method for learning representations of poses for 3D deformable objects, which specializes in 1) disentangling pose information from the object's identity, 2) facilitating the learning of pose variations, and 3)…

Computer Vision and Pattern Recognition · Computer Science 2024-11-05 Seungwoo Yoo , Juil Koo , Kyeongmin Yeo , Minhyuk Sung

Learning Spatio-Temporal Transformer for Visual Tracking

In this paper, we present a new tracking architecture with an encoder-decoder transformer as the key component. The encoder models the global spatio-temporal feature dependencies between target objects and search regions, while the decoder…

Computer Vision and Pattern Recognition · Computer Science 2021-04-01 Bin Yan , Houwen Peng , Jianlong Fu , Dong Wang , Huchuan Lu

CRISP: Object Pose and Shape Estimation with Test-Time Adaptation

We consider the problem of estimating object pose and shape from an RGB-D image. Our first contribution is to introduce CRISP, a category-agnostic object pose and shape estimation pipeline. The pipeline implements an encoder-decoder model…

Computer Vision and Pattern Recognition · Computer Science 2024-12-03 Jingnan Shi , Rajat Talak , Harry Zhang , David Jin , Luca Carlone

NeuSurfEmb: A Complete Pipeline for Dense Correspondence-based 6D Object Pose Estimation without CAD Models

State-of-the-art approaches for 6D object pose estimation assume the availability of CAD models and require the user to manually set up physically-based rendering (PBR) pipelines for synthetic training data generation. Both factors limit…

Computer Vision and Pattern Recognition · Computer Science 2025-10-15 Francesco Milano , Jen Jen Chung , Hermann Blum , Roland Siegwart , Lionel Ott

Multi-path Learning for Object Pose Estimation Across Domains

We introduce a scalable approach for object pose estimation trained on simulated RGB views of multiple 3D models together. We learn an encoding of object views that does not only describe an implicit orientation of all objects seen during…

Computer Vision and Pattern Recognition · Computer Science 2020-04-06 Martin Sundermeyer , Maximilian Durner , En Yen Puang , Zoltan-Csaba Marton , Narunas Vaskevicius , Kai O. Arras , Rudolph Triebel

iCaps: Iterative Category-level Object Pose and Shape Estimation

This paper proposes a category-level 6D object pose and shape estimation approach iCaps, which allows tracking 6D poses of unseen objects in a category and estimating their 3D shapes. We develop a category-level auto-encoder network using…

Computer Vision and Pattern Recognition · Computer Science 2022-01-04 Xinke Deng , Junyi Geng , Timothy Bretl , Yu Xiang , Dieter Fox

Category-Agnostic 6D Pose Estimation with Conditional Neural Processes

We present a novel meta-learning approach for 6D pose estimation on unknown objects. In contrast to ``instance-level" and ``category-level" pose estimation methods, our algorithm learns object representation in a category-agnostic way,…

Computer Vision and Pattern Recognition · Computer Science 2023-10-20 Yumeng Li , Ning Gao , Hanna Ziesche , Gerhard Neumann

PoET: Pose Estimation Transformer for Single-View, Multi-Object 6D Pose Estimation

Accurate 6D object pose estimation is an important task for a variety of robotic applications such as grasping or localization. It is a challenging task due to object symmetries, clutter and occlusion, but it becomes more challenging when…

Computer Vision and Pattern Recognition · Computer Science 2022-11-28 Thomas Jantos , Mohamed Amin Hamdad , Wolfgang Granig , Stephan Weiss , Jan Steinbrener

ZebraPose: Coarse to Fine Surface Encoding for 6DoF Object Pose Estimation

Establishing correspondences from image to 3D has been a key task of 6DoF object pose estimation for a long time. To predict pose more accurately, deeply learned dense maps replaced sparse templates. Dense methods also improved pose…

Computer Vision and Pattern Recognition · Computer Science 2022-03-31 Yongzhi Su , Mahdi Saleh , Torben Fetzer , Jason Rambach , Nassir Navab , Benjamin Busam , Didier Stricker , Federico Tombari

SO(3)-Pose: SO(3)-Equivariance Learning for 6D Object Pose Estimation

6D pose estimation of rigid objects from RGB-D images is crucial for object grasping and manipulation in robotics. Although RGB channels and the depth (D) channel are often complementary, providing respectively the appearance and geometry…

Computer Vision and Pattern Recognition · Computer Science 2022-08-18 Haoran Pan , Jun Zhou , Yuanpeng Liu , Xuequan Lu , Weiming Wang , Xuefeng Yan , Mingqiang Wei

LEADER: Learning Reliable Local-to-Global Correspondences for LiDAR Relocalization

LiDAR relocalization has attracted increasing attention as it can deliver accurate 6-DoF pose estimation in complex 3D environments. Recent learning-based regression methods offer efficient solutions by directly predicting global poses…

Computer Vision and Pattern Recognition · Computer Science 2026-04-14 Jianshi Wu , Minghang Zhu , Dunqiang Liu , Wen Li , Sheng Ao , Siqi Shen , Chenglu Wen , Cheng Wang

Self-supervised Learning of Implicit Shape Representation with Dense Correspondence for Deformable Objects

Learning 3D shape representation with dense correspondence for deformable objects is a fundamental problem in computer vision. Existing approaches often need additional annotations of specific semantic domain, e.g., skeleton poses for human…

Computer Vision and Pattern Recognition · Computer Science 2023-12-27 Baowen Zhang , Jiahe Li , Xiaoming Deng , Yinda Zhang , Cuixia Ma , Hongan Wang

iPose: Instance-Aware 6D Pose Estimation of Partly Occluded Objects

We address the task of 6D pose estimation of known rigid objects from single input images in scenarios where the objects are partly occluded. Recent RGB-D-based methods are robust to moderate degrees of occlusion. For RGB inputs, no…

Computer Vision and Pattern Recognition · Computer Science 2018-06-19 Omid Hosseini Jafari , Siva Karthik Mustikovela , Karl Pertsch , Eric Brachmann , Carsten Rother

Shape Prior Deformation for Categorical 6D Object Pose and Size Estimation

We present a novel learning approach to recover the 6D poses and sizes of unseen object instances from an RGB-D image. To handle the intra-class shape variation, we propose a deep network to reconstruct the 3D object model by explicitly…

Computer Vision and Pattern Recognition · Computer Science 2020-07-17 Meng Tian , Marcelo H Ang , Gim Hee Lee