English
Related papers

Related papers: Video based Object 6D Pose Estimation using Transf…

200 papers

We introduce a simple yet effective algorithm that uses convolutional neural networks to directly estimate object poses from videos. Our approach leverages the temporal information from a video sequence, and is computationally efficient and…

Computer Vision and Pattern Recognition · Computer Science 2021-11-23 Apoorva Beedu , Zhile Ren , Varun Agrawal , Irfan Essa

Estimating the 6D object pose is an essential task in many applications. Due to the lack of depth information, existing RGB-based methods are sensitive to occlusion and illumination changes. How to extract and utilize the geometry features…

Computer Vision and Pattern Recognition · Computer Science 2024-04-24 Xiao Lin , Deming Wang , Guangliang Zhou , Chengju Liu , Qijun Chen

In the current state of 6D pose estimation, top-performing techniques depend on complex intermediate correspondences, specialized architectures, and non-end-to-end algorithms. In contrast, our research reframes the problem as a…

Computer Vision and Pattern Recognition · Computer Science 2023-11-30 Sebastian Stapf , Tobias Bauernfeind , Marco Riboldi

The attention mechanism provides a sequential prediction framework for learning spatial models with enhanced implicit temporal consistency. In this work, we show a systematic design (from 2D to 3D) for how conventional networks and other…

Computer Vision and Pattern Recognition · Computer Science 2021-03-05 Ruixu Liu , Ju Shen , He Wang , Chen Chen , Sen-ching Cheung , Vijayan K. Asari

While CNN-based models have made remarkable progress on human pose estimation, what spatial dependencies they capture to localize keypoints remains unclear. In this work, we propose a model called \textbf{TransPose}, which introduces…

Computer Vision and Pattern Recognition · Computer Science 2021-09-02 Sen Yang , Zhibin Quan , Mu Nie , Wankou Yang

6D pose estimation is the task of predicting the translation and orientation of objects in a given input image, which is a crucial prerequisite for many robotics and augmented reality applications. Lately, the Transformer Network…

Computer Vision and Pattern Recognition · Computer Science 2021-09-24 Arash Amini , Arul Selvam Periyasamy , Sven Behnke

Learning based 6D object pose estimation methods rely on computing large intermediate pose representations and/or iteratively refining an initial estimation with a slow render-compare pipeline. This paper introduces a novel method we call…

Computer Vision and Pattern Recognition · Computer Science 2022-10-24 Pedro Castro , Tae-Kyun Kim

As demand for robotics manipulation application increases, accurate vision-based 6D pose estimation becomes essential for autonomous operations. Convolutional Neural Networks (CNNs) based approaches for pose estimation have been previously…

Computer Vision and Pattern Recognition · Computer Science 2023-07-13 Mahmoud Abdulsalam , Nabil Aouf

In this paper we introduce EfficientPose, a new approach for 6D object pose estimation. Our method is highly accurate, efficient and scalable over a wide range of computational resources. Moreover, it can detect the 2D bounding box of…

Computer Vision and Pattern Recognition · Computer Science 2020-11-19 Yannick Bukschat , Marcus Vetter

We seek to extract a temporally consistent 6D pose trajectory of a manipulated object from an Internet instructional video. This is a challenging set-up for current 6D pose estimation methods due to uncontrolled capturing conditions, subtle…

Computer Vision and Pattern Recognition · Computer Science 2025-03-14 Georgy Ponimatkin , Martin Cífka , Tomáš Souček , Médéric Fourmy , Yann Labbé , Vladimir Petrik , Josef Sivic

In this paper, we introduce a novel single shot approach for 6D object pose estimation of rigid objects based on depth images. For this purpose, a fully convolutional neural network is employed, where the 3D input data is spatially…

Computer Vision and Pattern Recognition · Computer Science 2020-04-28 Kilian Kleeberger , Marco F. Huber

Real-time robotic grasping, supporting a subsequent precise object-in-hand operation task, is a priority target towards highly advanced autonomous systems. However, such an algorithm which can perform sufficiently-accurate grasping with…

Computer Vision and Pattern Recognition · Computer Science 2021-11-12 Tuan-Tang Le , Trung-Son Le , Yu-Ru Chen , Joel Vidal , Chyi-Yeu Lin

In this paper, we address the challenging task of estimating 6D object pose from a single RGB image. Motivated by the deep learning based object detection methods, we propose a concise and efficient network that integrate 6D object pose…

Computer Vision and Pattern Recognition · Computer Science 2020-02-21 Jianhan Mei , Henghui Ding , Xudong Jiang

6D object pose estimation is a crucial prerequisite for autonomous robot manipulation applications. The state-of-the-art models for pose estimation are convolutional neural network (CNN)-based. Lately, Transformers, an architecture…

Computer Vision and Pattern Recognition · Computer Science 2023-07-24 Arul Selvam Periyasamy , Arash Amini , Vladimir Tsaturyan , Sven Behnke

Accurate 6D object pose estimation from images is a key problem in object-centric scene understanding, enabling applications in robotics, augmented reality, and scene reconstruction. Despite recent advances, existing methods often produce…

Computer Vision and Pattern Recognition · Computer Science 2025-04-01 Martin Malenický , Martin Cífka , Médéric Fourmy , Louis Montaut , Justin Carpentier , Josef Sivic , Vladimir Petrik

We present RePOSE, a fast iterative refinement method for 6D object pose estimation. Prior methods perform refinement by feeding zoomed-in input and rendered RGB images into a CNN and directly regressing an update of a refined pose. Their…

Computer Vision and Pattern Recognition · Computer Science 2021-08-20 Shun Iwase , Xingyu Liu , Rawal Khirodkar , Rio Yokota , Kris M. Kitani

We propose a fast and accurate 6D object pose estimation from a RGB-D image. Our proposed method is template matching based and consists of three main technical components, PCOF-MOD (multimodal PCOF), balanced pose tree (BPT) and optimum…

Computer Vision and Pattern Recognition · Computer Science 2020-03-10 Yoshinori Konishi , Kosuke Hattori , Manabu Hashimoto

The most recent trend in estimating the 6D pose of rigid objects has been to train deep networks to either directly regress the pose from the image or to predict the 2D locations of 3D keypoints, from which the pose can be obtained using a…

Computer Vision and Pattern Recognition · Computer Science 2019-04-09 Yinlin Hu , Joachim Hugonot , Pascal Fua , Mathieu Salzmann

6D object pose estimation is a crucial prerequisite for autonomous robot manipulation applications. The state-of-the-art models for pose estimation are convolutional neural network (CNN)-based. Lately, Transformers, an architecture…

Computer Vision and Pattern Recognition · Computer Science 2022-05-06 Arash Amini , Arul Selvam Periyasamy , Sven Behnke

We introduce MegaPose, a method to estimate the 6D pose of novel objects, that is, objects unseen during training. At inference time, the method only assumes knowledge of (i) a region of interest displaying the object in the image and (ii)…

Computer Vision and Pattern Recognition · Computer Science 2022-12-15 Yann Labbé , Lucas Manuelli , Arsalan Mousavian , Stephen Tyree , Stan Birchfield , Jonathan Tremblay , Justin Carpentier , Mathieu Aubry , Dieter Fox , Josef Sivic
‹ Prev 1 2 3 10 Next ›