Related papers: Multi-path Learning for Object Pose Estimation Acr…

Scene-level Pose Estimation for Multiple Instances of Densely Packed Objects

This paper introduces key machine learning operations that allow the realization of robust, joint 6D pose estimation of multiple instances of objects either densely packed or in unstructured piles from RGB-D data. The first objective is to…

Robotics · Computer Science 2019-10-14 Chaitanya Mitash , Bowen Wen , Kostas Bekris , Abdeslam Boularias

Multi-view object pose estimation from correspondence distributions and epipolar geometry

In many automation tasks involving manipulation of rigid objects, the poses of the objects must be acquired. Vision-based pose estimation using a single RGB or RGB-D sensor is especially popular due to its broad applicability. However,…

Computer Vision and Pattern Recognition · Computer Science 2023-03-24 Rasmus Laurvig Haugaard , Thorbjørn Mosekjær Iversen

Implicit 3D Orientation Learning for 6D Object Detection from RGB Images

We propose a real-time RGB-based pipeline for object detection and 6D pose estimation. Our novel 3D orientation estimation is based on a variant of the Denoising Autoencoder that is trained on simulated views of a 3D model using Domain…

Computer Vision and Pattern Recognition · Computer Science 2019-07-18 Martin Sundermeyer , Zoltan-Csaba Marton , Maximilian Durner , Manuel Brucker , Rudolph Triebel

A Unified Framework for Multi-View Multi-Class Object Pose Estimation

One core challenge in object pose estimation is to ensure accurate and robust performance for large numbers of diverse foreground objects amidst complex background clutter. In this work, we present a scalable framework for accurately…

Computer Vision and Pattern Recognition · Computer Science 2018-10-09 Chi Li , Jin Bai , Gregory D. Hager

CASAPose: Class-Adaptive and Semantic-Aware Multi-Object Pose Estimation

Applications in the field of augmented reality or robotics often require joint localisation and 6D pose estimation of multiple objects. However, most algorithms need one network per object class to be trained in order to provide the best…

Computer Vision and Pattern Recognition · Computer Science 2022-12-12 Niklas Gard , Anna Hilsmann , Peter Eisert

6D Pose Estimation for Textureless Objects on RGB Frames using Multi-View Optimization

6D pose estimation of textureless objects is a valuable but challenging task for many robotic applications. In this work, we propose a framework to address this challenge using only RGB images acquired from multiple viewpoints. The core…

Robotics · Computer Science 2023-02-23 Jun Yang , Wenjie Xue , Sahar Ghavidel , Steven L. Waslander

Category-Agnostic 6D Pose Estimation with Conditional Neural Processes

We present a novel meta-learning approach for 6D pose estimation on unknown objects. In contrast to ``instance-level" and ``category-level" pose estimation methods, our algorithm learns object representation in a category-agnostic way,…

Computer Vision and Pattern Recognition · Computer Science 2023-10-20 Yumeng Li , Ning Gao , Hanna Ziesche , Gerhard Neumann

Object Pose Transformer: Unifying Unseen Object Pose Estimation

Learning model-free object pose estimation for unseen instances remains a fundamental challenge in 3D vision. Existing methods typically fall into two disjoint paradigms: category-level approaches predict absolute poses in a canonical space…

Computer Vision and Pattern Recognition · Computer Science 2026-03-25 Weihang Li , Lorenzo Garattoni , Fabien Despinoy , Nassir Navab , Benjamin Busam

DualPoseNet: Category-level 6D Object Pose and Size Estimation Using Dual Pose Network with Refined Learning of Pose Consistency

Category-level 6D object pose and size estimation is to predict full pose configurations of rotation, translation, and size for object instances observed in single, arbitrary views of cluttered scenes. In this paper, we propose a new method…

Computer Vision and Pattern Recognition · Computer Science 2021-08-17 Jiehong Lin , Zewei Wei , Zhihao Li , Songcen Xu , Kui Jia , Yuanqing Li

Human Insights Driven Latent Space for Different Driving Perspectives: A Unified Encoder for Efficient Multi-Task Inference

Autonomous driving systems require a comprehensive understanding of the environment, achieved by extracting visual features essential for perception, planning, and control. However, models trained solely on single-task objectives or generic…

Computer Vision and Pattern Recognition · Computer Science 2026-04-03 Huy-Dung Nguyen , Anass Bairouk , Mirjana Maras , Wei Xiao , Tsun-Hsuan Wang , Patrick Chareyre , Ramin Hasani , Marc Blanchon , Daniela Rus

OVE6D: Object Viewpoint Encoding for Depth-based 6D Object Pose Estimation

This paper proposes a universal framework, called OVE6D, for model-based 6D object pose estimation from a single depth image and a target object mask. Our model is trained using purely synthetic data rendered from ShapeNet, and, unlike most…

Computer Vision and Pattern Recognition · Computer Science 2022-04-11 Dingding Cai , Janne Heikkilä , Esa Rahtu

ShapeCodes: Self-Supervised Feature Learning by Lifting Views to Viewgrids

We introduce an unsupervised feature learning approach that embeds 3D shape information into a single-view image representation. The main idea is a self-supervised training objective that, given only a single 2D image, requires all unseen…

Computer Vision and Pattern Recognition · Computer Science 2018-08-01 Dinesh Jayaraman , Ruohan Gao , Kristen Grauman

Pixel-Semantic Revise of Position Learning A One-Stage Object Detector with A Shared Encoder-Decoder

Recently, many methods have been proposed for object detection. They cannot detect objects by semantic features, adaptively. In this work, according to channel and spatial attention mechanisms, we mainly analyze that different methods…

Computer Vision and Pattern Recognition · Computer Science 2020-09-30 Qian Li , Nan Guo , Xiaochun Ye , Dongrui Fan , Zhimin Tang

Unsupervised Geometry-Aware Representation for 3D Human Pose Estimation

Modern 3D human pose estimation techniques rely on deep networks, which require large amounts of training data. While weakly-supervised methods require less supervision, by utilizing 2D poses or multi-view imagery without annotations, they…

Computer Vision and Pattern Recognition · Computer Science 2018-04-05 Helge Rhodin , Mathieu Salzmann , Pascal Fua

Learning Object Localization and 6D Pose Estimation from Simulation and Weakly Labeled Real Images

This work proposes a process for efficiently training a point-wise object detector that enables localizing objects and computing their 6D poses in cluttered and occluded scenes. Accurate pose estimation is typically a requirement for robust…

Computer Vision and Pattern Recognition · Computer Science 2019-02-22 Jean-Philippe Mercier , Chaitanya Mitash , Philippe Giguère , Abdeslam Boularias

Unifying Visual Perception by Dispersible Points Learning

We present a conceptually simple, flexible, and universal visual perception head for variant visual tasks, e.g., classification, object detection, instance segmentation and pose estimation, and different frameworks, such as one-stage or…

Computer Vision and Pattern Recognition · Computer Science 2022-09-13 Jianming Liang , Guanglu Song , Biao Leng , Yu Liu

Object Pose Estimation from Monocular Image using Multi-View Keypoint Correspondence

Understanding the geometry and pose of objects in 2D images is a fundamental necessity for a wide range of real world applications. Driven by deep neural networks, recent methods have brought significant improvements to object pose…

Computer Vision and Pattern Recognition · Computer Science 2018-09-05 Jogendra Nath Kundu , Rahul M. V. , Aditya Ganeshan , R. Venkatesh Babu

RotationNet: Joint Object Categorization and Pose Estimation Using Multiviews from Unsupervised Viewpoints

We propose a Convolutional Neural Network (CNN)-based model "RotationNet," which takes multi-view images of an object as input and jointly estimates its pose and object category. Unlike previous approaches that use known viewpoint labels…

Computer Vision and Pattern Recognition · Computer Science 2018-03-26 Asako Kanezaki , Yasuyuki Matsushita , Yoshifumi Nishida

Towards Generalizable Multi-Camera 3D Object Detection via Perspective Debiasing

Detecting objects in 3D space using multiple cameras, known as Multi-Camera 3D Object Detection (MC3D-Det), has gained prominence with the advent of bird's-eye view (BEV) approaches. However, these methods often struggle when faced with…

Computer Vision and Pattern Recognition · Computer Science 2023-12-27 Hao Lu , Yunpeng Zhang , Qing Lian , Dalong Du , Yingcong Chen

One2Any: One-Reference 6D Pose Estimation for Any Object

6D object pose estimation remains challenging for many applications due to dependencies on complete 3D models, multi-view images, or training limited to specific object categories. These requirements make generalization to novel objects…

Computer Vision and Pattern Recognition · Computer Science 2025-05-08 Mengya Liu , Siyuan Li , Ajad Chhatkuli , Prune Truong , Luc Van Gool , Federico Tombari