Related papers: Leveraging Positional Encoding for Robust Multi-Re…

Learnable Fourier Features for Multi-Dimensional Spatial Positional Encoding

Attentional mechanisms are order-invariant. Positional encoding is a crucial component to allow attention-based deep model architectures such as Transformer to address sequences or images where the position of information matters. In this…

Machine Learning · Computer Science 2021-11-10 Yang Li , Si Si , Gang Li , Cho-Jui Hsieh , Samy Bengio

Learning Regularized Positional Encoding for Molecular Prediction

Machine learning has become a promising approach for molecular modeling. Positional quantities, such as interatomic distances and bond angles, play a crucial role in molecule physics. The existing works rely on careful manual design of…

Machine Learning · Computer Science 2022-11-24 Xiang Gao , Weihao Gao , Wenzhi Xiao , Zhirui Wang , Chong Wang , Liang Xiang

Spatial Attention Improves Iterative 6D Object Pose Estimation

The task of estimating the 6D pose of an object from RGB images can be broken down into two main steps: an initial pose estimation step, followed by a refinement procedure to correctly register the object and its observation. In this paper,…

Computer Vision and Pattern Recognition · Computer Science 2021-01-06 Stefan Stevsic , Otmar Hilliges

RNNPose: Recurrent 6-DoF Object Pose Refinement with Robust Correspondence Field Estimation and Pose Optimization

6-DoF object pose estimation from a monocular image is challenging, and a post-refinement procedure is generally needed for high-precision estimation. In this paper, we propose a framework based on a recurrent neural network (RNN) for…

Computer Vision and Pattern Recognition · Computer Science 2022-04-12 Yan Xu , Kwan-Yee Lin , Guofeng Zhang , Xiaogang Wang , Hongsheng Li

DPOD: 6D Pose Object Detector and Refiner

In this paper we present a novel deep learning method for 3D object detection and 6D pose estimation from RGB images. Our method, named DPOD (Dense Pose Object Detector), estimates dense multi-class 2D-3D correspondence maps between an…

Computer Vision and Pattern Recognition · Computer Science 2019-08-21 Sergey Zakharov , Ivan Shugurov , Slobodan Ilic

ZebraPose: Coarse to Fine Surface Encoding for 6DoF Object Pose Estimation

Establishing correspondences from image to 3D has been a key task of 6DoF object pose estimation for a long time. To predict pose more accurately, deeply learned dense maps replaced sparse templates. Dense methods also improved pose…

Computer Vision and Pattern Recognition · Computer Science 2022-03-31 Yongzhi Su , Mahdi Saleh , Torben Fetzer , Jason Rambach , Nassir Navab , Benjamin Busam , Didier Stricker , Federico Tombari

Robust 6D Object Pose Estimation by Learning RGB-D Features

Accurate 6D object pose estimation is fundamental to robotic manipulation and grasping. Previous methods follow a local optimization approach which minimizes the distance between closest point pairs to handle the rotation ambiguity of…

Computer Vision and Pattern Recognition · Computer Science 2020-03-10 Meng Tian , Liang Pan , Marcelo H Ang , Gim Hee Lee

Pose Proposal Critic: Robust Pose Refinement by Learning Reprojection Errors

In recent years, considerable progress has been made for the task of rigid object pose estimation from a single RGB-image, but achieving robustness to partial occlusions remains a challenging problem. Pose refinement via rendering has shown…

Computer Vision and Pattern Recognition · Computer Science 2020-05-15 Lucas Brynte , Fredrik Kahl

Robust Camera Pose Refinement for Multi-Resolution Hash Encoding

Multi-resolution hash encoding has recently been proposed to reduce the computational cost of neural renderings, such as NeRF. This method requires accurate camera poses for the neural renderings of given scenes. However, contrary to…

Computer Vision and Pattern Recognition · Computer Science 2023-02-06 Hwan Heo , Taekyung Kim , Jiyoung Lee , Jaewon Lee , Soohyun Kim , Hyunwoo J. Kim , Jin-Hwa Kim

RePOSE: Fast 6D Object Pose Refinement via Deep Texture Rendering

We present RePOSE, a fast iterative refinement method for 6D object pose estimation. Prior methods perform refinement by feeding zoomed-in input and rendered RGB images into a CNN and directly regressing an update of a refined pose. Their…

Computer Vision and Pattern Recognition · Computer Science 2021-08-20 Shun Iwase , Xingyu Liu , Rawal Khirodkar , Rio Yokota , Kris M. Kitani

WALDO: Where Unseen Model-based 6D Pose Estimation Meets Occlusion

Accurate 6D object pose estimation is vital for robotics, augmented reality, and scene understanding. For seen objects, high accuracy is often attainable via per-object fine-tuning but generalizing to unseen objects remains a challenge. To…

Computer Vision and Pattern Recognition · Computer Science 2025-11-21 Sajjad Pakdamansavoji , Yintao Ma , Amir Rasouli , Tongtong Cao

Making Deep Heatmaps Robust to Partial Occlusions for 3D Object Pose Estimation

We introduce a novel method for robust and accurate 3D object pose estimation from a single color image under large occlusions. Following recent approaches, we first predict the 2D projections of 3D points related to the target object and…

Computer Vision and Pattern Recognition · Computer Science 2018-07-27 Markus Oberweger , Mahdi Rad , Vincent Lepetit

NeRF-Pose: A First-Reconstruct-Then-Regress Approach for Weakly-supervised 6D Object Pose Estimation

Pose estimation of 3D objects in monocular images is a fundamental and long-standing problem in computer vision. Existing deep learning approaches for 6D pose estimation typically rely on the assumption of availability of 3D object models…

Computer Vision and Pattern Recognition · Computer Science 2023-09-12 Fu Li , Hao Yu , Ivan Shugurov , Benjamin Busam , Shaowu Yang , Slobodan Ilic

Refining 6D Object Pose Predictions using Abstract Render-and-Compare

Robotic systems often require precise scene analysis capabilities, especially in unstructured, cluttered situations, as occurring in human-made environments. While current deep-learning based methods yield good estimates of object poses,…

Computer Vision and Pattern Recognition · Computer Science 2019-10-09 Arul Selvam Periyasamy , Max Schwarz , Sven Behnke

Review on 6D Object Pose Estimation with the focus on Indoor Scene Understanding

6D object pose estimation problem has been extensively studied in the field of Computer Vision and Robotics. It has wide range of applications such as robot manipulation, augmented reality, and 3D scene understanding. With the advent of…

Computer Vision and Pattern Recognition · Computer Science 2023-04-13 Negar Nejatishahidin , Pooya Fayyazsanavi

Introduction to Camera Pose Estimation with Deep Learning

Over the last two decades, deep learning has transformed the field of computer vision. Deep convolutional networks were successfully applied to learn different vision tasks such as image classification, image segmentation, object detection…

Computer Vision and Pattern Recognition · Computer Science 2019-07-17 Yoli Shavit , Ron Ferens

CoordAR: One-Reference 6D Pose Estimation of Novel Objects via Autoregressive Coordinate Map Generation

Object 6D pose estimation, a crucial task for robotics and augmented reality applications, becomes particularly challenging when dealing with novel objects whose 3D models are not readily available. To reduce dependency on 3D models, recent…

Computer Vision and Pattern Recognition · Computer Science 2025-12-16 Dexin Zuo , Ang Li , Wei Wang , Wenxian Yu , Danping Zou

Improving Robustness and Accuracy via Relative Information Encoding in 3D Human Pose Estimation

Most of the existing 3D human pose estimation approaches mainly focus on predicting 3D positional relationships between the root joint and other human joints (local motion) instead of the overall trajectory of the human body (global…

Computer Vision and Pattern Recognition · Computer Science 2021-07-30 Wenkang Shan , Haopeng Lu , Shanshe Wang , Xinfeng Zhang , Wen Gao

Improving Machine Learning-Based Robot Self-Collision Checking with Input Positional Encoding

This manuscript investigates the integration of positional encoding -- a technique widely used in computer graphics -- into the input vector of a binary classification model for self-collision detection. The results demonstrate the benefits…

Robotics · Computer Science 2026-04-21 Bartłomiej Kulecki , Dominik Belter

ConvPoseCNN2: Prediction and Refinement of Dense 6D Object Poses

Object pose estimation is a key perceptual capability in robotics. We propose a fully-convolutional extension of the PoseCNN method, which densely predicts object translations and orientations. This has several advantages such as improving…

Computer Vision and Pattern Recognition · Computer Science 2022-05-24 Arul Selvam Periyasamy , Catherine Capellen , Max Schwarz , Sven Behnke