机器人学 — Scifaro

Predictive Spatio-Temporal Scene Graphs for Semi-Static Scenes

We have seen tremendous recent progress in our ability to build "spatio-semantic" representations that enable robots to perform complex reasoning across geometry and semantics. However, the vast majority of these methods lack any ability to…

机器人学 · 计算机科学 2026-05-04 Miguel Saavedra-Ruiz , Charlie Gauthier , Kumaraditya Gupta , Shima Shahfar , Kirsty Ellis , Steven Parkison , Liam Paull

World Model for Robot Learning: A Comprehensive Survey

World models, which are predictive representations of how environments evolve under actions, have become a central component of robot learning. They support policy learning, planning, simulation, evaluation, data generation, and have…

机器人学 · 计算机科学 2026-05-04 Bohan Hou , Gen Li , Jindou Jia , Tuo An , Xinying Guo , Sicong Leng , Haoran Geng , Yanjie Ze , Tatsuya Harada , Philip Torr , Oier Mees , Marc Pollefeys , Zhuang Liu , Jiajun Wu , Pieter Abbeel , Jitendra Malik , Yilun Du , Jianfei Yang

Being-H0.7: A Latent World-Action Model from Egocentric Videos

Visual-Language-Action models (VLAs) have advanced generalist robot control by mapping multimodal observations and language instructions directly to actions, but sparse action supervision often encourages shortcut mappings rather than…

机器人学 · 计算机科学 2026-05-04 Hao Luo , Wanpeng Zhang , Yicheng Feng , Sipeng Zheng , Haiweng Xu , Chaoyi Xu , Ziheng Xi , Yuhui Fu , Zongqing Lu

Do Open-Loop Metrics Predict Closed-Loop Driving? A Cross-Benchmark Correlation Study of NAVSIM and Bench2Drive

Open-loop evaluation offers fast, reproducible assessment of autonomous driving planners, but its ability to predict real closed-loop driving performance remains questionable. Prior work has shown that traditional open-loop metrics such as…

机器人学 · 计算机科学 2026-05-04 Yiru Wang , Anqing Jiang , Shuo Wang , Yuwen Heng , Hai Yang , Yang Chen , Hao Sun

Dynamic-TD3: A Novel Algorithm for UAV Path Planning with Dynamic Obstacle Trajectory Prediction

Deep reinforcement learning (DRL) finds extensive application in autonomous drone navigation within complex, high-risk environments. However, its practical deployment faces a safety-exploration dilemma: soft penalty mechanisms encourage…

机器人学 · 计算机科学 2026-05-04 Wentao Chen , Jingtang Chen , Mingjian Fu , Tiantian Li , Youfeng Su , Wenxi Liu , Yuanlong Yu

MotuBrain: An Advanced World Action Model for Robot Control

Vision-Language-Action (VLA) models generalize semantically well but often lack fine-grained modeling of world dynamics. We present MotuBrain, a unified World Action Model that jointly models video and action under a UniDiffuser formulation…

机器人学 · 计算机科学 2026-05-04 MotuBrain Team , Chendong Xiang , Fan Bao , Haitian Liu , Hengkai Tan , Hongzhe Bi , James Li , Jiabao Liu , Jingrui Pang , Kiro Jing , Louis Liu , Mengchen Cai , Rongxu Cui , Ruowen Zhao , Runqing Wang , Shuhe Huang , Yao Feng , Yinze Rong , Zeyuan Wang , Jun Zhu

STARRY: Spatial-Temporal Action-Centric World Modeling for Robotic Manipulation

Robotic manipulation requires reasoning about future spatial-temporal interactions and geometric constraints, yet existing Vision-Language-Action (VLA) policies often leave predictive representation weakly coupled with action execution,…

机器人学 · 计算机科学 2026-05-04 Yuxuan Tian , Yurun Jin , Bin Yu , Yukun Shi , Hao Wu , Chi Harold Liu , Kai Chen , Cong Huang

Sensitivity-Based Tube NMPC for Cooperative Aerial Structures Under Parametric Uncertainty

This paper presents a sensitivity-based tube Nonlinear Model Predictive Control (NMPC) framework for cooperative aerial chains under bounded parametric uncertainty. We consider a planar two-vehicle chain connected by rigid links, modeled…

机器人学 · 计算机科学 2026-05-04 Giuseppe Silano , Quentin Sablé , Marco Tognon , Luigi Iannelli , Antonio Franchi

Energy-Efficient Multi-Robot Coverage Path Planning of Non-Convex Regions of Interests

This letter presents an energy-efficient multi-robot coverage path planning (MRCPP) framework for large, nonconvex Regions of Interest (ROI) containing obstacles and no-fly zones (NFZ). Existing minimum-energy coverage planning algorithms…

机器人学 · 计算机科学 2026-05-04 Sourav Raxit , Jose Fuentes , Paulo Padrao , Abdullah Al Redwan Newaz , Md Tamjidul Hoque , Mark Kulp , Leonardo Bobadilla

Certifiable Factor Graph Optimization

We show that the factor graph and certifiable estimation paradigms, which have thus far been treated as essentially independent in the literature, can be naturally synthesized into a unified framework for certifiable factor graph…

机器人学 · 计算机科学 2026-05-04 Zhexin Xu , Nikolas R. Sanderson , Hanna Jiamei Zhang , David M. Rosen

Variable Elimination in Hybrid Factor Graphs for Discrete-Continuous Inference & Estimation

Many problems in robotics involve both continuous and discrete components, and modeling them together for estimation tasks has been a long standing and difficult problem. Hybrid Factor Graphs give us a mathematical framework to model these…

机器人学 · 计算机科学 2026-05-04 Varun Agrawal , Frank Dellaert

VLBiMan: Vision-Language Anchored One-Shot Demonstration Enables Generalizable Bimanual Robotic Manipulation

Achieving generalizable bimanual manipulation requires systems that can learn efficiently from minimal human input while adapting to real-world uncertainties and diverse embodiments. Existing approaches face a dilemma: imitation policy…

机器人学 · 计算机科学 2026-05-04 Huayi Zhou , Kui Jia

VLAs are Confined yet Capable of Generalizing to Novel Instructions

Vision-language-action models (VLAs) often achieve high performance on demonstrated tasks but struggle significantly when required to extrapolate, combining skills learned from different tasks in novel ways. For instance, VLAs might…

机器人学 · 计算机科学 2026-05-04 Quanyi Li

Causality-enhanced Decision-Making for Autonomous Mobile Robots in Dynamic Environments

The growing integration of robots in shared environments-such as warehouses, shopping centres, and hospitals-demands a deep understanding of the underlying dynamics and human behaviours, including how, when, and where individuals engage in…

机器人学 · 计算机科学 2026-05-04 Luca Castri , Gloria Beraldo , Nicola Bellotto

A Survey on Vision-Language-Action Models for Embodied AI

Embodied AI is widely recognized as a cornerstone of artificial general intelligence (AGI) because it involves controlling embodied agents to perform tasks in the physical world. Building on the success of large language models (LLMs) and…

机器人学 · 计算机科学 2026-05-04 Yueen Ma , Zixing Song , Yuzheng Zhuang , Jianye Hao , Irwin King

OmniRobotHome: A Multi-Camera Platform for Real-Time Multiadic Human-Robot Interaction

Human-robot collaboration has been studied primarily in dyadic or sequential settings. However, real homes require multiadic collaboration, where multiple humans and robots share a workspace, acting concurrently on interleaved subtasks with…

机器人学 · 计算机科学 2026-05-01 Junyoung Lee , Sookwan Han , Jeonghwan Kim , Inhee Lee , Mingi Choi , Jisoo Kim , Wonjung Woo , Hanbyul Joo

RopeDreamer: A Kinematic Recurrent State Space Model for Dynamics of Flexible Deformable Linear Objects

The robotic manipulation of Deformable Linear Objects (DLOs) is a fundamental challenge due to the high-dimensional, non-linear dynamics of flexible structures and the complexity of maintaining topological integrity during contact-rich…

机器人学 · 计算机科学 2026-05-01 Tim Missal , Lucas Domingues , Berk Guler , Simon Manschitz , Jan Peters , Paula Dornhofer Paro Costa

FlexiTac: A Low-Cost, Open-Source, Scalable Tactile Sensing Solution for Robotic Systems

We present FlexiTac, a low-cost, open-source, and scalable piezoresistive tactile sensing solution designed for robotic end-effectors. FlexiTac is a practical "plug-in" module consisting of (i) thin, flexible tactile sensor pads that…

机器人学 · 计算机科学 2026-05-01 Binghao Huang , Yunzhu Li

FreeOcc: Training-Free Embodied Open-Vocabulary Occupancy Prediction

Existing learning-based occupancy prediction methods rely on large-scale 3D annotations and generalize poorly across environments. We present FreeOcc, a training-free framework for open-vocabulary occupancy prediction from monocular or…

机器人学 · 计算机科学 2026-05-01 Zeyu Jiang , Changqing Zhou , Xingxing Zuo , Changhao Chen

Framework for Collaborative Operation of Autonomous Delivery Vehicles Within a Marshaling Yard

As autonomous vehicles slowly deploy into urban roads for limited use cases with significant edge case issues, closed facilities like marshaling yards provide a ripe case for combining lower-level vehicle autonomy with fixed infrastructure…

机器人学 · 计算机科学 2026-05-01 James O'Hara , Karl Wunderlich , Gregory Stevens