English
Related papers

Related papers: Leveraging Positional Encoding for Robust Multi-Re…

200 papers

Attentional mechanisms are order-invariant. Positional encoding is a crucial component to allow attention-based deep model architectures such as Transformer to address sequences or images where the position of information matters. In this…

Machine Learning · Computer Science 2021-11-10 Yang Li , Si Si , Gang Li , Cho-Jui Hsieh , Samy Bengio

Machine learning has become a promising approach for molecular modeling. Positional quantities, such as interatomic distances and bond angles, play a crucial role in molecule physics. The existing works rely on careful manual design of…

Machine Learning · Computer Science 2022-11-24 Xiang Gao , Weihao Gao , Wenzhi Xiao , Zhirui Wang , Chong Wang , Liang Xiang

The task of estimating the 6D pose of an object from RGB images can be broken down into two main steps: an initial pose estimation step, followed by a refinement procedure to correctly register the object and its observation. In this paper,…

Computer Vision and Pattern Recognition · Computer Science 2021-01-06 Stefan Stevsic , Otmar Hilliges

6-DoF object pose estimation from a monocular image is challenging, and a post-refinement procedure is generally needed for high-precision estimation. In this paper, we propose a framework based on a recurrent neural network (RNN) for…

Computer Vision and Pattern Recognition · Computer Science 2022-04-12 Yan Xu , Kwan-Yee Lin , Guofeng Zhang , Xiaogang Wang , Hongsheng Li

In this paper we present a novel deep learning method for 3D object detection and 6D pose estimation from RGB images. Our method, named DPOD (Dense Pose Object Detector), estimates dense multi-class 2D-3D correspondence maps between an…

Computer Vision and Pattern Recognition · Computer Science 2019-08-21 Sergey Zakharov , Ivan Shugurov , Slobodan Ilic

Establishing correspondences from image to 3D has been a key task of 6DoF object pose estimation for a long time. To predict pose more accurately, deeply learned dense maps replaced sparse templates. Dense methods also improved pose…

Computer Vision and Pattern Recognition · Computer Science 2022-03-31 Yongzhi Su , Mahdi Saleh , Torben Fetzer , Jason Rambach , Nassir Navab , Benjamin Busam , Didier Stricker , Federico Tombari

Accurate 6D object pose estimation is fundamental to robotic manipulation and grasping. Previous methods follow a local optimization approach which minimizes the distance between closest point pairs to handle the rotation ambiguity of…

Computer Vision and Pattern Recognition · Computer Science 2020-03-10 Meng Tian , Liang Pan , Marcelo H Ang , Gim Hee Lee

In recent years, considerable progress has been made for the task of rigid object pose estimation from a single RGB-image, but achieving robustness to partial occlusions remains a challenging problem. Pose refinement via rendering has shown…

Computer Vision and Pattern Recognition · Computer Science 2020-05-15 Lucas Brynte , Fredrik Kahl

Multi-resolution hash encoding has recently been proposed to reduce the computational cost of neural renderings, such as NeRF. This method requires accurate camera poses for the neural renderings of given scenes. However, contrary to…

Computer Vision and Pattern Recognition · Computer Science 2023-02-06 Hwan Heo , Taekyung Kim , Jiyoung Lee , Jaewon Lee , Soohyun Kim , Hyunwoo J. Kim , Jin-Hwa Kim

We present RePOSE, a fast iterative refinement method for 6D object pose estimation. Prior methods perform refinement by feeding zoomed-in input and rendered RGB images into a CNN and directly regressing an update of a refined pose. Their…

Computer Vision and Pattern Recognition · Computer Science 2021-08-20 Shun Iwase , Xingyu Liu , Rawal Khirodkar , Rio Yokota , Kris M. Kitani

Accurate 6D object pose estimation is vital for robotics, augmented reality, and scene understanding. For seen objects, high accuracy is often attainable via per-object fine-tuning but generalizing to unseen objects remains a challenge. To…

Computer Vision and Pattern Recognition · Computer Science 2025-11-21 Sajjad Pakdamansavoji , Yintao Ma , Amir Rasouli , Tongtong Cao

We introduce a novel method for robust and accurate 3D object pose estimation from a single color image under large occlusions. Following recent approaches, we first predict the 2D projections of 3D points related to the target object and…

Computer Vision and Pattern Recognition · Computer Science 2018-07-27 Markus Oberweger , Mahdi Rad , Vincent Lepetit

Pose estimation of 3D objects in monocular images is a fundamental and long-standing problem in computer vision. Existing deep learning approaches for 6D pose estimation typically rely on the assumption of availability of 3D object models…

Computer Vision and Pattern Recognition · Computer Science 2023-09-12 Fu Li , Hao Yu , Ivan Shugurov , Benjamin Busam , Shaowu Yang , Slobodan Ilic

Robotic systems often require precise scene analysis capabilities, especially in unstructured, cluttered situations, as occurring in human-made environments. While current deep-learning based methods yield good estimates of object poses,…

Computer Vision and Pattern Recognition · Computer Science 2019-10-09 Arul Selvam Periyasamy , Max Schwarz , Sven Behnke

6D object pose estimation problem has been extensively studied in the field of Computer Vision and Robotics. It has wide range of applications such as robot manipulation, augmented reality, and 3D scene understanding. With the advent of…

Computer Vision and Pattern Recognition · Computer Science 2023-04-13 Negar Nejatishahidin , Pooya Fayyazsanavi

Over the last two decades, deep learning has transformed the field of computer vision. Deep convolutional networks were successfully applied to learn different vision tasks such as image classification, image segmentation, object detection…

Computer Vision and Pattern Recognition · Computer Science 2019-07-17 Yoli Shavit , Ron Ferens

Object 6D pose estimation, a crucial task for robotics and augmented reality applications, becomes particularly challenging when dealing with novel objects whose 3D models are not readily available. To reduce dependency on 3D models, recent…

Computer Vision and Pattern Recognition · Computer Science 2025-12-16 Dexin Zuo , Ang Li , Wei Wang , Wenxian Yu , Danping Zou

Most of the existing 3D human pose estimation approaches mainly focus on predicting 3D positional relationships between the root joint and other human joints (local motion) instead of the overall trajectory of the human body (global…

Computer Vision and Pattern Recognition · Computer Science 2021-07-30 Wenkang Shan , Haopeng Lu , Shanshe Wang , Xinfeng Zhang , Wen Gao

This manuscript investigates the integration of positional encoding -- a technique widely used in computer graphics -- into the input vector of a binary classification model for self-collision detection. The results demonstrate the benefits…

Robotics · Computer Science 2026-04-21 Bartłomiej Kulecki , Dominik Belter

Object pose estimation is a key perceptual capability in robotics. We propose a fully-convolutional extension of the PoseCNN method, which densely predicts object translations and orientations. This has several advantages such as improving…

Computer Vision and Pattern Recognition · Computer Science 2022-05-24 Arul Selvam Periyasamy , Catherine Capellen , Max Schwarz , Sven Behnke
‹ Prev 1 2 3 10 Next ›