机器人学 — Scifaro

Diffusion Policy for Coordinated Control of a Nonholonomic Mobile Base and Dual Arms in Door Opening and Passing

Opening heavy, self closing doors, especially those that require pulling remains a long standing challenge in robotics. Humans naturally employ both arms in a dexterous manner, rotating the handle, widening the gap, holding the door,…

机器人学 · 计算机科学 2026-05-18 Shangqun Yu , Matthew En , Daniel Wu , Sangjun Park , Ziyi Zhou , Seyed Fakoorian , Donghyun Kim

PhysBrain 1.0 Technical Report

Vision-language-action models have advanced rapidly, but robot trajectories alone provide limited coverage for learning broad physical understanding. PhysBrain 1.0 studies a complementary route: converting large-scale human egocentric video…

机器人学 · 计算机科学 2026-05-18 Shijie Lian , Bin Yu , Xiaopeng Lin , Changti Wu , Hang Yuan , Xiaolin Hu , Zhaolong Shen , Yuzhuo Miao , Haishan Liu , Yuxuan Tian , Yukun Shi , Cong Huang , Kai Chen

CLOVER: Closed-Loop Value Estimation and Ranking for End-to-End Autonomous Driving Planning

End-to-end autonomous driving planners are commonly trained by imitating a single logged trajectory, yet evaluated by rule-based planning metrics that measure safety, feasibility, progress, and comfort. This creates a training--evaluation…

机器人学 · 计算机科学 2026-05-18 Sining Ang , Yuguang Yang , Canyu Chen , Yan Wang

Towards Robotic Dexterous Hand Intelligence: A Survey

Robotic dexterous hands are central to contact-rich manipulation, with rapid progress driven by advances in hardware, sensing, control, simulation, and data generation. However, existing studies are often developed under different…

机器人学 · 计算机科学 2026-05-18 Weiguang Zhao , Tian Liang , Xihao Guo , Rui Zhang , Irwin King , Kaizhu Huang

Coordinated Diffusion: Generating Multi-Agent Behavior Without Multi-Agent Demonstrations

Imitation learning powered by generative models has proven effective for modeling complex single-agent behaviors. However, teaching multi-agent systems, like multiple arms or vehicles, to coordinate through imitation learning is hindered by…

机器人学 · 计算机科学 2026-05-18 Lasse Peters , Laura Ferranti , Andrea Bajcsy , Javier Alonso-Mora

ConsistNav: Closing the Action Consistency Gap in Zero-Shot Object Navigation with Semantic Executive Control

Zero-shot object navigation has advanced rapidly with open-vocabulary detectors, image--text models, and language-guided exploration. However, even after current methods detect a plausible target hypothesis, the agent may still oscillate…

机器人学 · 计算机科学 2026-05-18 Haosen Wang , Zhenyang Li , Yinqiang Zhang , Zongqi He , Lutao Jiang , Kai Li , Yizhou Zhao , Liaoyuan Fan , Wenjian Hou , Tingbang Liang , Yibin Wen , Defeng Gu

GSDrive: Reinforcing Driving Policies by Multi-mode Future Trajectory Probing with 3D Gaussian Splatting Environment

End-to-end (E2E) autonomous driving aims to directly map sensory observations to driving actions, but its real-world deployment is hindered by evolving data distributions and the high cost of continual annotation. While combining imitation…

机器人学 · 计算机科学 2026-05-18 Ziang Guo , Chen Min , Xuefeng Zhang , Yixiao Zhou , Shuo Wang , Sifa Zheng , Dzmitry Tsetserukou , Zufeng Zhang

Vision-Based Safe Human-Robot Collaboration with Uncertainty Guarantees

We propose a framework for vision-based human pose estimation and motion prediction that gives conformal prediction guarantees for certifiably safe human-robot collaboration. Our framework combines aleatoric uncertainty estimation with OOD…

机器人学 · 计算机科学 2026-05-18 Jakob Thumm , Marian Frei , Tianle Ni , Matthias Althoff , Marco Pavone

A Hierarchical Spatiotemporal Action Tokenizer for In-Context Imitation Learning in Robotics

We present a novel hierarchical spatiotemporal action tokenizer for in-context imitation learning. We first propose a hierarchical approach, which consists of two successive levels of vector quantization. In particular, the lower level…

机器人学 · 计算机科学 2026-05-18 Fawad Javed Fateh , Ali Shah Ali , Murad Popattia , Usman Nizamani , Andrey Konin , M. Zeeshan Zia , Quoc-Huy Tran

frax: Fast Robot Kinematics and Dynamics in JAX

In robot control, planning, and learning, there is a need for rigid-body dynamics libraries that are highly performant, easy to use, and compatible with CPUs and accelerators. While existing libraries often excel at either low-latency CPU…

机器人学 · 计算机科学 2026-05-18 Daniel Morton , Marco Pavone

Learning Structured Robot Policies from Vision-Language Models via Synthetic Neuro-Symbolic Supervision

Vision-Language Models (VLMs) have recently demonstrated strong capabilities in mapping multimodal observations to robot behaviors. However, most current approaches rely on end-to-end visuomotor policies that remain opaque and difficult to…

机器人学 · 计算机科学 2026-05-18 Alessandro Adami , Tommaso Tubaldo , Marco Todescato , Ruggero Carli , Pietro Falco

OpenFrontier: General Navigation with Visual-Language Grounded Frontiers

Open-world navigation requires robots to make decisions in complex everyday environments while adapting to flexible task requirements. Conventional navigation approaches often rely on dense 3D reconstruction and hand-crafted goal metrics,…

机器人学 · 计算机科学 2026-05-18 Esteban Padilla-Cerdio , Boyang Sun , Marc Pollefeys , Hermann Blum

HoMMI: Learning Whole-Body Mobile Manipulation from Human Demonstrations

We present Whole-Body Mobile Manipulation Interface (HoMMI), a data collection and policy learning framework that learns whole-body mobile manipulation directly from robot-free human demonstrations. We augment UMI interfaces with egocentric…

机器人学 · 计算机科学 2026-05-18 Xiaomeng Xu , Jisang Park , Han Zhang , Eric Cousineau , Aditya Bhat , Jose Barreiros , Dian Wang , Jeannette Bohg , Shuran Song

The OncoReach Stylet for Brachytherapy: Design Evaluation and Pilot Study

Cervical cancer accounts for a significant portion of the global cancer burden among women. Interstitial brachytherapy (ISBT) is a standard procedure for treating cervical cancer; it involves placing a radioactive source through a straight…

机器人学 · 计算机科学 2026-05-18 Pejman Kheradmand , Kent K. Yamamoto , Emma Webster , Keith Sowards , Gianna Hatheway , Katharine L. Jackson , Sabino Zani , Julie A. Raffi , Diandra N. Ayala-Peacock , Scott R. Silva , Joanna Deaton Bertram , Yash Chitalia

Sparse ActionGen: Accelerating Diffusion Policy with Real-time Pruning

Diffusion Policy has dominated action generation due to its strong capabilities for modeling multi-modal action distributions, but its multi-step denoising processes make it impractical for real-time visuomotor control. Existing…

机器人学 · 计算机科学 2026-05-18 Kangye Ji , Jianbo Zhou , Yuan Meng , Ye Li , Hanyun Cui , Zhi Wang

CLARE: Continual Learning for Vision-Language-Action Models via Autonomous Adapter Routing and Expansion

To teach robots complex manipulation tasks, a common approach is to fine-tune a pre-trained vision-language-action model (VLA) on task-specific data. However, since this recipe updates existing representations, it is unsuitable for…

机器人学 · 计算机科学 2026-05-18 Ralf Römer , Yi Zhang , Yuming Li , Angela P. Schoellig

An Introduction to Deep Reinforcement and Imitation Learning

Embodied agents, such as robots and virtual characters, must continuously select actions to execute tasks effectively, solving complex sequential decision-making problems. Given the difficulty of designing such controllers manually,…

机器人学 · 计算机科学 2026-05-18 Pedro Santana

Empowering Robot Teleoperation: Exploring the Synergies Between Devices and Manipulator Controllers in a Comparative Study

Robot learning empowers the robot system with human brain-like intelligence to autonomously acquire and adapt skills through experience, enhancing flexibility and adaptability in various environments. Aimed at achieving a similar level of…

机器人学 · 计算机科学 2026-05-18 Yuxuan Zhao , Yuanchen Tang , Jindi Zhang , Hongyu Yu

Whole-body motion planning and safety-critical control for aerial manipulation

Aerial manipulation combines the maneuverability of multirotors with the dexterity of robotic arms to perform complex tasks in cluttered spaces. Yet planning safe, dynamically feasible trajectories remains difficult due to whole-body…

机器人学 · 计算机科学 2026-05-18 Lin Yang , Jinwoo Lee , Domenico Campolo , H. Jin Kim , Jeonghyun Byun

Flatness-based trajectory planning for 3D overhead cranes with friction compensation and collision avoidance

This paper presents an optimal trajectory generation method for 3D overhead cranes by leveraging differential flatness. This framework enables the direct inclusion of complex physical and dynamic constraints, such as nonlinear friction and…

机器人学 · 计算机科学 2026-05-18 Jorge Vicente-Martinez , Edgar Ramirez-Laboreo