Related papers: Video based Object 6D Pose Estimation using Transf…

VideoPose: Estimating 6D object pose from videos

We introduce a simple yet effective algorithm that uses convolutional neural networks to directly estimate object poses from videos. Our approach leverages the temporal information from a video sequence, and is computationally efficient and…

Computer Vision and Pattern Recognition · Computer Science 2021-11-23 Apoorva Beedu , Zhile Ren , Varun Agrawal , Irfan Essa

TransPose: 6D Object Pose Estimation with Geometry-Aware Transformer

Estimating the 6D object pose is an essential task in many applications. Due to the lack of depth information, existing RGB-based methods are sensitive to occlusion and illumination changes. How to extract and utilize the geometry features…

Computer Vision and Pattern Recognition · Computer Science 2024-04-24 Xiao Lin , Deming Wang , Guangliang Zhou , Chengju Liu , Qijun Chen

PViT-6D: Overclocking Vision Transformers for 6D Pose Estimation with Confidence-Level Prediction and Pose Tokens

In the current state of 6D pose estimation, top-performing techniques depend on complex intermediate correspondences, specialized architectures, and non-end-to-end algorithms. In contrast, our research reframes the problem as a…

Computer Vision and Pattern Recognition · Computer Science 2023-11-30 Sebastian Stapf , Tobias Bauernfeind , Marco Riboldi

Enhanced 3D Human Pose Estimation from Videos by using Attention-Based Neural Network with Dilated Convolutions

The attention mechanism provides a sequential prediction framework for learning spatial models with enhanced implicit temporal consistency. In this work, we show a systematic design (from 2D to 3D) for how conventional networks and other…

Computer Vision and Pattern Recognition · Computer Science 2021-03-05 Ruixu Liu , Ju Shen , He Wang , Chen Chen , Sen-ching Cheung , Vijayan K. Asari

TransPose: Keypoint Localization via Transformer

While CNN-based models have made remarkable progress on human pose estimation, what spatial dependencies they capture to localize keypoints remains unclear. In this work, we propose a model called \textbf{TransPose}, which introduces…

Computer Vision and Pattern Recognition · Computer Science 2021-09-02 Sen Yang , Zhibin Quan , Mu Nie , Wankou Yang

T6D-Direct: Transformers for Multi-Object 6D Pose Direct Regression

6D pose estimation is the task of predicting the translation and orientation of objects in a given input image, which is a crucial prerequisite for many robotics and augmented reality applications. Lately, the Transformer Network…

Computer Vision and Pattern Recognition · Computer Science 2021-09-24 Arash Amini , Arul Selvam Periyasamy , Sven Behnke

CRT-6D: Fast 6D Object Pose Estimation with Cascaded Refinement Transformers

Learning based 6D object pose estimation methods rely on computing large intermediate pose representations and/or iteratively refining an initial estimation with a slow render-compare pipeline. This paper introduces a novel method we call…

Computer Vision and Pattern Recognition · Computer Science 2022-10-24 Pedro Castro , Tae-Kyun Kim

TransPose: A Transformer-based 6D Object Pose Estimation Network with Depth Refinement

As demand for robotics manipulation application increases, accurate vision-based 6D pose estimation becomes essential for autonomous operations. Convolutional Neural Networks (CNNs) based approaches for pose estimation have been previously…

Computer Vision and Pattern Recognition · Computer Science 2023-07-13 Mahmoud Abdulsalam , Nabil Aouf

EfficientPose: An efficient, accurate and scalable end-to-end 6D multi object pose estimation approach

In this paper we introduce EfficientPose, a new approach for 6D object pose estimation. Our method is highly accurate, efficient and scalable over a wide range of computational resources. Moreover, it can detect the 2D bounding box of…

Computer Vision and Pattern Recognition · Computer Science 2020-11-19 Yannick Bukschat , Marcus Vetter

6D Object Pose Tracking in Internet Videos for Robotic Manipulation

We seek to extract a temporally consistent 6D pose trajectory of a manipulated object from an Internet instructional video. This is a challenging set-up for current 6D pose estimation methods due to uncontrolled capturing conditions, subtle…

Computer Vision and Pattern Recognition · Computer Science 2025-03-14 Georgy Ponimatkin , Martin Cífka , Tomáš Souček , Médéric Fourmy , Yann Labbé , Vladimir Petrik , Josef Sivic

Single Shot 6D Object Pose Estimation

In this paper, we introduce a novel single shot approach for 6D object pose estimation of rigid objects based on depth images. For this purpose, a fully convolutional neural network is employed, where the 3D input data is spatially…

Computer Vision and Pattern Recognition · Computer Science 2020-04-28 Kilian Kleeberger , Marco F. Huber

6D Pose Estimation with Combined Deep Learning and 3D Vision Techniques for a Fast and Accurate Object Grasping

Real-time robotic grasping, supporting a subsequent precise object-in-hand operation task, is a priority target towards highly advanced autonomous systems. However, such an algorithm which can perform sufficiently-accurate grasping with…

Computer Vision and Pattern Recognition · Computer Science 2021-11-12 Tuan-Tang Le , Trung-Son Le , Yu-Ru Chen , Joel Vidal , Chyi-Yeu Lin

Object 6D Pose Estimation with Non-local Attention

In this paper, we address the challenging task of estimating 6D object pose from a single RGB image. Motivated by the deep learning based object detection methods, we propose a concise and efficient network that integrate 6D object pose…

Computer Vision and Pattern Recognition · Computer Science 2020-02-21 Jianhan Mei , Henghui Ding , Xudong Jiang

YOLOPose V2: Understanding and Improving Transformer-based 6D Pose Estimation

6D object pose estimation is a crucial prerequisite for autonomous robot manipulation applications. The state-of-the-art models for pose estimation are convolutional neural network (CNN)-based. Lately, Transformers, an architecture…

Computer Vision and Pattern Recognition · Computer Science 2023-07-24 Arul Selvam Periyasamy , Arash Amini , Vladimir Tsaturyan , Sven Behnke

PhysPose: Refining 6D Object Poses with Physical Constraints

Accurate 6D object pose estimation from images is a key problem in object-centric scene understanding, enabling applications in robotics, augmented reality, and scene reconstruction. Despite recent advances, existing methods often produce…

Computer Vision and Pattern Recognition · Computer Science 2025-04-01 Martin Malenický , Martin Cífka , Médéric Fourmy , Louis Montaut , Justin Carpentier , Josef Sivic , Vladimir Petrik

RePOSE: Fast 6D Object Pose Refinement via Deep Texture Rendering

We present RePOSE, a fast iterative refinement method for 6D object pose estimation. Prior methods perform refinement by feeding zoomed-in input and rendered RGB images into a CNN and directly regressing an update of a refined pose. Their…

Computer Vision and Pattern Recognition · Computer Science 2021-08-20 Shun Iwase , Xingyu Liu , Rawal Khirodkar , Rio Yokota , Kris M. Kitani

Real-Time 6D Object Pose Estimation on CPU

We propose a fast and accurate 6D object pose estimation from a RGB-D image. Our proposed method is template matching based and consists of three main technical components, PCOF-MOD (multimodal PCOF), balanced pose tree (BPT) and optimum…

Computer Vision and Pattern Recognition · Computer Science 2020-03-10 Yoshinori Konishi , Kosuke Hattori , Manabu Hashimoto

Segmentation-driven 6D Object Pose Estimation

The most recent trend in estimating the 6D pose of rigid objects has been to train deep networks to either directly regress the pose from the image or to predict the 2D locations of 3D keypoints, from which the pose can be obtained using a…

Computer Vision and Pattern Recognition · Computer Science 2019-04-09 Yinlin Hu , Joachim Hugonot , Pascal Fua , Mathieu Salzmann

YOLOPose: Transformer-based Multi-Object 6D Pose Estimation using Keypoint Regression

6D object pose estimation is a crucial prerequisite for autonomous robot manipulation applications. The state-of-the-art models for pose estimation are convolutional neural network (CNN)-based. Lately, Transformers, an architecture…

Computer Vision and Pattern Recognition · Computer Science 2022-05-06 Arash Amini , Arul Selvam Periyasamy , Sven Behnke

MegaPose: 6D Pose Estimation of Novel Objects via Render & Compare

We introduce MegaPose, a method to estimate the 6D pose of novel objects, that is, objects unseen during training. At inference time, the method only assumes knowledge of (i) a region of interest displaying the object in the image and (ii)…

Computer Vision and Pattern Recognition · Computer Science 2022-12-15 Yann Labbé , Lucas Manuelli , Arsalan Mousavian , Stephen Tyree , Stan Birchfield , Jonathan Tremblay , Justin Carpentier , Mathieu Aubry , Dieter Fox , Josef Sivic