In order to enable robust operation in unstructured environments, robots should be able to generalize manipulation actions to novel object instances. For example, to pour and serve a drink, a robot should be able to recognize novel containers which afford the task. Most importantly, robots should be able to manipulate these novel containers to fulfill the task. To achieve this, we aim to provide robust and generalized perception of object affordances and their associated manipulation poses for reliable manipulation. In this work, we combine the notions of affordance and category-level pose, and introduce the Affordance Coordinate Frame (ACF). With ACF, we represent each object class in terms of individual affordance parts and the compatibility between them, where each part is associated with a part category-level pose for robot manipulation. In our experiments, we demonstrate that ACF outperforms state-of-the-art methods for object detection, as well as category-level pose estimation for object parts. We further demonstrate the applicability of ACF to robot manipulation tasks through experiments in a simulated environment.
@article{arxiv.2010.08202,
title = {Manipulation-Oriented Object Perception in Clutter through Affordance Coordinate Frames},
author = {Xiaotong Chen and Kaizhi Zheng and Zhen Zeng and Cameron Kisailus and Shreshtha Basu and James Cooney and Jana Pavlasek and Odest Chadwicke Jenkins},
journal= {arXiv preprint arXiv:2010.08202},
year = {2022}
}
Comments
Humanoids 2022 paper. video link: https://www.youtube.com/watch?v=VkLI8m_3erU; https://www.youtube.com/watch?v=7P9_O9wveYk, github link: https://github.com/cxt98/ACF_perception