Related papers: Human-oriented Representation Learning for Robotic…

Manipulate by Seeing: Creating Manipulation Controllers from Pre-Trained Representations

The field of visual representation learning has seen explosive growth in the past years, but its benefits in robotics have been surprisingly limited so far. Prior work uses generic visual representations as a basis to learn (task-specific)…

Robotics · Computer Science 2023-08-16 Jianren Wang , Sudeep Dasari , Mohan Kumar Srirama , Shubham Tulsiani , Abhinav Gupta

Object-Centric Action-Enhanced Representations for Robot Visuo-Motor Policy Learning

Learning visual representations from observing actions to benefit robot visuo-motor policy generation is a promising direction that closely resembles human cognitive function and perception. Motivated by this, and further inspired by…

Robotics · Computer Science 2025-05-28 Nikos Giannakakis , Argyris Manetas , Panagiotis P. Filntisis , Petros Maragos , George Retsinas

Aligning Robot Representations with Humans

As robots are increasingly deployed in real-world scenarios, a key question is how to best transfer knowledge learned in one environment to another, where shifting constraints and human preferences render adaptation challenging. A central…

Human-Computer Interaction · Computer Science 2022-05-18 Andreea Bobu , Andi Peng

Task-Oriented Hierarchical Object Decomposition for Visuomotor Control

Good pre-trained visual representations could enable robots to learn visuomotor policy efficiently. Still, existing representations take a one-size-fits-all-tasks approach that comes with two important drawbacks: (1) Being completely…

Robotics · Computer Science 2024-11-05 Jianing Qian , Yunshuang Li , Bernadette Bucher , Dinesh Jayaraman

A Review on Robot Manipulation Methods in Human-Robot Interactions

Robot manipulation is an important part of human-robot interaction technology. However, traditional pre-programmed methods can only accomplish simple and repetitive tasks. To enable effective communication between robots and humans, and to…

Robotics · Computer Science 2023-09-12 Haoxu Zhang , Parham M. Kebria , Shady Mohamed , Samson Yu , Saeid Nahavandi

Universal Humanoid Motion Representations for Physics-Based Control

We present a universal motion representation that encompasses a comprehensive range of motor skills for physics-based humanoid control. Due to the high dimensionality of humanoids and the inherent difficulties in reinforcement learning,…

Computer Vision and Pattern Recognition · Computer Science 2024-04-15 Zhengyi Luo , Jinkun Cao , Josh Merel , Alexander Winkler , Jing Huang , Kris Kitani , Weipeng Xu

A Survey of Embodied Learning for Object-Centric Robotic Manipulation

Embodied learning for object-centric robotic manipulation is a rapidly developing and challenging area in embodied AI. It is crucial for advancing next-generation intelligent robots and has garnered significant interest recently. Unlike…

Robotics · Computer Science 2025-01-15 Ying Zheng , Lei Yao , Yuejiao Su , Yi Zhang , Yi Wang , Sicheng Zhao , Yiyi Zhang , Lap-Pui Chau

Object-Centric Representations Improve Policy Generalization in Robot Manipulation

Visual representations are central to the learning and generalization capabilities of robotic manipulation policies. While existing methods rely on global or dense features, such representations often entangle task-relevant and irrelevant…

Robotics · Computer Science 2025-05-20 Alexandre Chapin , Bruno Machado , Emmanuel Dellandrea , Liming Chen

Accelerating Interactive Human-like Manipulation Learning with GPU-based Simulation and High-quality Demonstrations

Dexterous manipulation with anthropomorphic robot hands remains a challenging problem in robotics because of the high-dimensional state and action spaces and complex contacts. Nevertheless, skillful closed-loop manipulation is required to…

Robotics · Computer Science 2022-12-06 Malte Mosbach , Kara Moraw , Sven Behnke

Aligning Robot and Human Representations

To act in the world, robots rely on a representation of salient task aspects: for example, to carry a coffee mug, a robot may consider movement efficiency or mug orientation in its behavior. However, if we want robots to act for and with…

Robotics · Computer Science 2024-01-30 Andreea Bobu , Andi Peng , Pulkit Agrawal , Julie Shah , Anca D. Dragan

Learning-based Cooperative Robotic Paper Wrapping: A Unified Control Policy with Residual Force Control

Human-robot cooperation is essential in environments such as warehouses and retail stores, where workers frequently handle deformable objects like paper, bags, and fabrics. Coordinating robotic actions with human assistance remains…

Robotics · Computer Science 2025-11-06 Rewida Ali , Cristian C. Beltran-Hernandez , Weiwei Wan , Kensuke Harada

R3M: A Universal Visual Representation for Robot Manipulation

We study how visual representations pre-trained on diverse human video data can enable data-efficient learning of downstream robotic manipulation tasks. Concretely, we pre-train a visual representation using the Ego4D human video dataset…

Robotics · Computer Science 2022-11-21 Suraj Nair , Aravind Rajeswaran , Vikash Kumar , Chelsea Finn , Abhinav Gupta

Making Sense of Vision and Touch: Learning Multimodal Representations for Contact-Rich Tasks

Contact-rich manipulation tasks in unstructured environments often require both haptic and visual feedback. It is non-trivial to manually design a robot controller that combines these modalities which have very different characteristics.…

Robotics · Computer Science 2019-07-31 Michelle A. Lee , Yuke Zhu , Peter Zachares , Matthew Tan , Krishnan Srinivasan , Silvio Savarese , Li Fei-Fei , Animesh Garg , Jeannette Bohg

Human-Humanoid Robots Cross-Embodiment Behavior-Skill Transfer Using Decomposed Adversarial Learning from Demonstration

Humanoid robots are envisioned as embodied intelligent agents capable of performing a wide range of human-level loco-manipulation tasks, particularly in scenarios requiring strenuous and repetitive labor. However, learning these skills is…

Robotics · Computer Science 2024-12-20 Junjia Liu , Zhuo Li , Minghao Yu , Zhipeng Dong , Sylvain Calinon , Darwin Caldwell , Fei Chen

Representation Matters: Improving Perception and Exploration for Robotics

Projecting high-dimensional environment observations into lower-dimensional structured representations can considerably improve data-efficiency for reinforcement learning in domains with limited data such as robotics. Can a single generally…

Machine Learning · Computer Science 2021-03-23 Markus Wulfmeier , Arunkumar Byravan , Tim Hertweck , Irina Higgins , Ankush Gupta , Tejas Kulkarni , Malcolm Reynolds , Denis Teplyashin , Roland Hafner , Thomas Lampe , Martin Riedmiller

Toward Artificial Palpation: Representation Learning of Touch on Soft Bodies

Palpation, the use of touch in medical examination, is almost exclusively performed by humans. We investigate a proof of concept for an artificial palpation method based on self-supervised learning. Our key idea is that an encoder-decoder…

Machine Learning · Computer Science 2025-11-21 Zohar Rimon , Elisei Shafer , Tal Tepper , Efrat Shimron , Aviv Tamar

H-InDex: Visual Reinforcement Learning with Hand-Informed Representations for Dexterous Manipulation

Human hands possess remarkable dexterity and have long served as a source of inspiration for robotic manipulation. In this work, we propose a human $\textbf{H}$and$\textbf{-In}$formed visual representation learning framework to solve…

Machine Learning · Computer Science 2023-10-16 Yanjie Ze , Yuyao Liu , Ruizhe Shi , Jiaxin Qin , Zhecheng Yuan , Jiashun Wang , Huazhe Xu

Vision-based Robot Manipulation Learning via Human Demonstrations

Vision-based learning methods provide promise for robots to learn complex manipulation tasks. However, how to generalize the learned manipulation skills to real-world interactions remains an open question. In this work, we study robotic…

Robotics · Computer Science 2020-03-03 Zhixin Jia , Mengxiang Lin , Zhixin Chen , Shibo Jian

Learning Representations that Enable Generalization in Assistive Tasks

Recent work in sim2real has successfully enabled robots to act in physical environments by training in simulation with a diverse ''population'' of environments (i.e. domain randomization). In this work, we focus on enabling generalization…

Machine Learning · Computer Science 2022-12-07 Jerry Zhi-Yang He , Aditi Raghunathan , Daniel S. Brown , Zackory Erickson , Anca D. Dragan

Selective Visual Representations Improve Convergence and Generalization for Embodied AI

Embodied AI models often employ off the shelf vision backbones like CLIP to encode their visual observations. Although such general purpose representations encode rich syntactic and semantic information about the scene, much of this…

Computer Vision and Pattern Recognition · Computer Science 2024-03-12 Ainaz Eftekhar , Kuo-Hao Zeng , Jiafei Duan , Ali Farhadi , Ani Kembhavi , Ranjay Krishna