English

SABER-6D: Shape Representation Based Implicit Object Pose Estimation

Computer Vision and Pattern Recognition 2024-09-04 v2

Abstract

In this paper, we propose a novel encoder-decoder architecture, named SABER, to learn the 6D pose of the object in the embedding space by learning shape representation at a given pose. This model enables us to learn pose by performing shape representation at a target pose from RGB image input. We perform shape representation as an auxiliary task which helps us in learning rotations space for an object based on 2D images. An image encoder predicts the rotation in the embedding space and the DeepSDF based decoder learns to represent the object's shape at the given pose. As our approach is shape based, the pipeline is suitable for any type of object irrespective of the symmetry. Moreover, we need only a CAD model of the objects to train SABER. Our pipeline is synthetic data based and can also handle symmetric objects without symmetry labels and, thus, no additional labeled training data is needed. The experimental evaluation shows that our method achieves close to benchmark results for both symmetric objects and asymmetric objects on Occlusion-LineMOD, and T-LESS datasets.

Keywords

Cite

@article{arxiv.2408.05867,
  title  = {SABER-6D: Shape Representation Based Implicit Object Pose Estimation},
  author = {Shishir Reddy Vutukur and Mengkejiergeli Ba and Benjamin Busam and Matthias Kayser and Gurprit Singh},
  journal= {arXiv preprint arXiv:2408.05867},
  year   = {2024}
}

Comments

ECCV 2024 R6D workshop

R2 v1 2026-06-28T18:09:58.503Z