机器人学 — Scifaro

RACAS: Controlling Diverse Robots With a Single Agentic System

Many robotic platforms expose an API through which external software can command their actuators and read their sensors. However, transitioning from these low-level interfaces to high-level autonomous behaviour requires a complicated…

机器人学 · 计算机科学 2026-03-12 Dylan R. Ashley , Jan Przepióra , Yimeng Chen , Ali Abualsaud , Nurzhan Yesmagambet , Shinkyu Park , Eric Feron , Jürgen Schmidhuber

Dull, Dirty, Dangerous: Understanding the Past, Present, and Future of a Key Motivation for Robotics

In robotics, the concept of "dull, dirty, and dangerous" (DDD) work has been used to motivate where robots might be useful. In this paper, we conduct an empirical analysis of robotics publications between 1980 and 2024 that mention DDD, and…

机器人学 · 计算机科学 2026-03-12 Nozomi Nakajima , Pedro Reynolds-Cuéllar , Caitrin Lynch , Kate Darling

Moving On, Even When You're Broken: Fail-Active Trajectory Generation via Diffusion Policies Conditioned on Embodiment and Task

Robot failure is detrimental and disruptive, often requiring human intervention to recover. Our vision is 'fail-active' operation, allowing robots to safely complete their tasks even when damaged. Focusing on 'actuation failures', we…

机器人学 · 计算机科学 2026-03-12 Gilberto G. Briscoe-Martinez , Yaashia Gautam , Rahul Shetty , Anuj Pasricha , Marco M. Nicotra , Alessandro Roncone

Cosmos-H-Surgical: Learning Surgical Robot Policies from Videos via World Modeling

Data scarcity remains a fundamental barrier to achieving fully autonomous surgical robots. While large scale vision language action (VLA) models have shown impressive generalization in household and industrial manipulation by leveraging…

机器人学 · 计算机科学 2026-03-12 Yufan He , Pengfei Guo , Mengya Xu , Zhaoshuo Li , Andriy Myronenko , Dillan Imans , Bingjie Liu , Dongren Yang , Mingxue Gu , Yongnan Ji , Yueming Jin , Ren Zhao , Baiyong Shen , Daguang Xu

Global End-Effector Pose Control of an Underactuated Aerial Manipulator via Reinforcement Learning

Aerial manipulators, which combine robotic arms with multi-rotor drones, face strict constraints on arm weight and mechanical complexity. In this work, we study a lightweight 2-degree-of-freedom (DoF) arm mounted on a quadrotor via a…

机器人学 · 计算机科学 2026-03-12 Shlok Deshmukh , Javier Alonso-Mora , Sihao Sun

PvP: Data-Efficient Humanoid Robot Learning with Proprioceptive-Privileged Contrastive Representations

Achieving efficient and robust whole-body control (WBC) is essential for enabling humanoid robots to perform complex tasks in dynamic environments. Despite the success of reinforcement learning (RL) in this domain, its sample inefficiency…

机器人学 · 计算机科学 2026-03-12 Mingqi Yuan , Tao Yu , Haolin Song , Bo Li , Xin Jin , Hua Chen , Wenjun Zeng

MergeVLA: Cross-Skill Model Merging Toward a Generalist Vision-Language-Action Agent

Recent Vision-Language-Action (VLA) models reformulate vision-language models by tuning them with millions of robotic demonstrations. While they perform well when fine-tuned for a single embodiment or task family, extending them to…

机器人学 · 计算机科学 2026-03-12 Yuxia Fu , Zhizhen Zhang , Yuqi Zhang , Zijian Wang , Zi Huang , Yadan Luo

Safe and Optimal Learning from Preferences via Weighted Temporal Logic with Applications in Robotics and Formula 1

Autonomous systems increasingly rely on human feedback to align their behavior, expressed as pairwise comparisons, rankings, or demonstrations. While existing methods can adapt behaviors, they often fail to guarantee safety in…

机器人学 · 计算机科学 2026-03-12 Ruya Karagulle , Cristian-Ioan Vasile , Necmiye Ozay

Symskill: Symbol and Skill Co-Invention for Data-Efficient and Reactive Long-Horizon Manipulation

Multi-step manipulation in dynamic environments remains challenging. Imitation learning (IL) is reactive but lacks compositional generalization, since monolithic policies do not decide which skill to reuse when scenes change. Classical…

机器人学 · 计算机科学 2026-03-12 Yifei Simon Shao , Yuchen Zheng , Sunan Sun , Pratik Chaudhari , Vijay Kumar , Nadia Figueroa

Self-Improving Loops for Visual Robotic Planning

Video generative models trained on expert demonstrations have been utilized as performant text-conditioned visual planners for solving robotic tasks. However, generalization to unseen tasks remains a challenge. Whereas improved…

机器人学 · 计算机科学 2026-03-12 Calvin Luo , Zilai Zeng , Mingxi Jia , Yilun Du , Chen Sun

A Chain-Driven, Sandwich-Legged Quadruped Robot: Design and Experimental Analysis

This paper introduces a chain-driven, sandwich-legged mid-size quadruped robot designed as an accessible research platform. The design prioritizes enhanced locomotion, improved actuation reliability and safety, and simplified,…

机器人学 · 计算机科学 2026-03-12 Aman Singh , Bhavya Giri Goswami , Ketan Nehete , Shishir N. Y. Kolathaya

vS-Graphs: Tightly Coupling Visual SLAM and 3D Scene Graphs Exploiting Hierarchical Scene Understanding

Current Visual Simultaneous Localization and Mapping (VSLAM) systems often struggle to create maps that are both semantically rich and easily interpretable. While incorporating semantic scene knowledge aids in building richer maps with…

机器人学 · 计算机科学 2026-03-12 Ali Tourani , Saad Ejaz , Hriday Bavle , Miguel Fernandez-Cortizas , David Morilla-Cabello , Jose Luis Sanchez-Lopez , Holger Voos

Open-World Task and Motion Planning via Vision-Language Model Generated Constraints

Foundation models like Vision-Language Models (VLMs) excel at common sense vision and language tasks such as visual question answering. However, they cannot yet directly solve complex, long-horizon robot manipulation problems requiring…

机器人学 · 计算机科学 2026-03-12 Nishanth Kumar , William Shen , Fabio Ramos , Dieter Fox , Tomás Lozano-Pérez , Leslie Pack Kaelbling , Caelan Reed Garrett

Automated Layout and Control Co-Design of Robust Multi-UAV Transportation Systems

The joint optimization of physical parameters and controllers in robotic systems is challenging. This is due to the difficulties of predicting the effect that changes in physical parameters have on final performances. At the same time,…

机器人学 · 计算机科学 2026-03-12 Carlo Bosio , Mark W. Mueller

TiPToP: A Modular Open-Vocabulary Planning System for Robotic Manipulation

We present TiPToP, an extensible modular system that combines pretrained vision foundation models with an existing Task and Motion Planner (TAMP) to solve multi-step manipulation tasks directly from input RGB images and natural-language…

机器人学 · 计算机科学 2026-03-11 William Shen , Nishanth Kumar , Sahit Chintalapudi , Jie Wang , Christopher Watson , Edward Hu , Jing Cao , Dinesh Jayaraman , Leslie Pack Kaelbling , Tomás Lozano-Pérez

BEACON: Language-Conditioned Navigation Affordance Prediction under Occlusion

Language-conditioned local navigation requires a robot to infer a nearby traversable target location from its current observation and an open-vocabulary, relational instruction. Existing vision-language spatial grounding methods usually…

机器人学 · 计算机科学 2026-03-11 Xinyu Gao , Gang Chen , Javier Alonso-Mora

Kinodynamic Motion Retargeting for Humanoid Locomotion via Multi-Contact Whole-Body Trajectory Optimization

We present the KinoDynamic Motion Retargeting (KDMR) framework, a novel approach for humanoid locomotion that models the retargeting process as a multi-contact, whole-body trajectory optimization problem. Conventional kinematics-based…

机器人学 · 计算机科学 2026-03-11 Xiaoyu Zhang , Steven Haener , Varun Madabushi , Maegan Tucker

NanoBench: A Multi-Task Benchmark Dataset for Nano-Quadrotor System Identification, Control, and State Estimation

Existing aerial-robotics benchmarks target vehicles from hundreds of grams to several kilograms and typically expose only high-level state data. They omit the actuator-level signals required to study nano-scale quadrotors, where…

机器人学 · 计算机科学 2026-03-11 Syed Izzat Ullah , Jose Baca

TIMID: Time-Dependent Mistake Detection in Videos of Robot Executions

As robotic systems execute increasingly difficult task sequences, so does the number of ways in which they can fail. Video Anomaly Detection (VAD) frameworks typically focus on singular, low-level kinematic or action failures, struggling to…

机器人学 · 计算机科学 2026-03-11 Nerea Gallego , Fernando Salanova , Claudio Mannarano , Cristian Mahulea , Eduardo Montijano

MuxGel: Simultaneous Dual-Modal Visuo-Tactile Sensing via Spatially Multiplexing and Deep Reconstruction

High-fidelity visuo-tactile sensing is important for precise robotic manipulation. However, most vision-based tactile sensors face a fundamental trade-off: opaque coatings enable tactile sensing but block pre-contact vision. To address…

机器人学 · 计算机科学 2026-03-11 Zhixian Hu , Zhengtong Xu , Sheeraz Athar , Juan Wachs , Yu She