English

Manipulation-Oriented Object Perception in Clutter through Affordance Coordinate Frames

Robotics 2022-10-12 v4 Computer Vision and Pattern Recognition

Abstract

In order to enable robust operation in unstructured environments, robots should be able to generalize manipulation actions to novel object instances. For example, to pour and serve a drink, a robot should be able to recognize novel containers which afford the task. Most importantly, robots should be able to manipulate these novel containers to fulfill the task. To achieve this, we aim to provide robust and generalized perception of object affordances and their associated manipulation poses for reliable manipulation. In this work, we combine the notions of affordance and category-level pose, and introduce the Affordance Coordinate Frame (ACF). With ACF, we represent each object class in terms of individual affordance parts and the compatibility between them, where each part is associated with a part category-level pose for robot manipulation. In our experiments, we demonstrate that ACF outperforms state-of-the-art methods for object detection, as well as category-level pose estimation for object parts. We further demonstrate the applicability of ACF to robot manipulation tasks through experiments in a simulated environment.

Keywords

Cite

@article{arxiv.2010.08202,
  title  = {Manipulation-Oriented Object Perception in Clutter through Affordance Coordinate Frames},
  author = {Xiaotong Chen and Kaizhi Zheng and Zhen Zeng and Cameron Kisailus and Shreshtha Basu and James Cooney and Jana Pavlasek and Odest Chadwicke Jenkins},
  journal= {arXiv preprint arXiv:2010.08202},
  year   = {2022}
}

Comments

Humanoids 2022 paper. video link: https://www.youtube.com/watch?v=VkLI8m_3erU; https://www.youtube.com/watch?v=7P9_O9wveYk, github link: https://github.com/cxt98/ACF_perception

R2 v1 2026-06-23T19:23:46.785Z