机器人学 — Scifaro

Guide, Think, Act: Interactive Embodied Reasoning in Vision-Language-Action Models

In this paper, we propose GTA-VLA(Guide, Think, Act), an interactive Vision-Language-Action (VLA) framework that enables spatially steerable embodied reasoning by allowing users to guide robot policies with explicit visual cues. Existing…

机器人学 · 计算机科学 2026-05-14 Yiran Ling , Qing Lian , Jinghang Li , Qing Jiang , Tianming Zhang , Xiaoke Jiang , Chuanxiu Liu , Jie Liu , Lei Zhang

Design of Magnetic Continuum Robots with Tunable Force Response Using Rotational Ring Pairs

In this paper, we discuss a novel continuum robot design that enables the online tuning of the magnetic response at its tip. The proposed method allows for the change of both effective magnetic direction and intensity, introducing steering…

机器人学 · 计算机科学 2026-05-14 Alex Sayres , Giovanni Pittiglio

Integration of an Agent Model into an Open Simulation Architecture for Scenario-Based Testing of Automated Vehicles

Simulative and scenario-based testing are crucial methods in the safety assurance for automated driving systems. To ensure that simulation results are reliable, the real world must be modeled with sufficient fidelity, including not only the…

机器人学 · 计算机科学 2026-05-14 Christian Geller , Daniel Becker , Jobst Beckmann , Lutz Eckstein

Uncertainty-Aware 3D Position Refinement for Multi-UAV Systems

Reliable real-time 3D localization is essential for multi-UAV navigation, collision avoidance, and coordinated flight, yet onboard estimates can degrade under GNSS multipath, non-line-of-sight reception, vertical drift, and intentional…

机器人学 · 计算机科学 2026-05-14 Hosam Alamleh , Damir Pulatov

CUBic: Coordinated Unified Bimanual Perception and Control Framework

Recent advances in visuomotor policy learning have enabled robots to perform control directly from visual inputs. Yet, extending such end-to-end learning from single-arm to bimanual manipulation remains challenging due to the need for both…

机器人学 · 计算机科学 2026-05-14 Xingyu Wang , Pengxiang Ding , Jingkai Xu , Donglin Wang , Zhaoxin Fan

Asymptotically Optimal Ergodic Coverage on Generalized Motion Fields

Autonomous robotic exploration in remote and extreme environments allows scientists to model complex transport phenomena and collective behaviors described by continuously deforming flow fields. Although these environments are naturally…

机器人学 · 计算机科学 2026-05-14 Christian Hughes , Yilang Liu , Yanis Lahrach , Julia Engdahl , Houston Warren , Darrick Lee , Fabio Ramos , Travis Miles , Ian Abraham

SID: Sliding into Distribution for Robust Few-Demonstration Manipulation

Generalizing robotic manipulation across object poses, viewpoints, and dynamic disturbances is difficult, especially with only a few demonstrations. End-to-end visuomotor policies are expressive but data-hungry, while planning and…

机器人学 · 计算机科学 2026-05-14 Yicheng Ma , Wei Yu , Zhian Su , Xidan Zhang , Huixu Dong

RotVLA: Rotational Latent Action for Vision-Language-Action Model

Latent Action Models (LAMs) have emerged as an effective paradigm for handling heterogeneous datasets during Vision-Language-Action (VLA) model pretraining, offering a unified action space across embodiments. However, existing LAMs often…

机器人学 · 计算机科学 2026-05-14 Qiwei Li , Xicheng Gong , Xinghang Li , Peiyan Li , Quanyun Zhou , Hangjun Ye , Jiahuan Zhou , Yadong Mu

BlockVLA: Accelerating Autoregressive VLA via Block Diffusion Finetuning

While autoregressive (AR) Vision-Language-Action (VLA) models have demonstrated formidable reasoning capabilities in robotic tasks, their sequential decoding process often incurs high inference latency and may amplify error accumulation…

机器人学 · 计算机科学 2026-05-14 Ruiheng Wang , Shuanghao Bai , Haoran Zhang , Badong Chen , Xiangyu Xu

Exploring Human-Robot Collaboration: Analysis of Interaction Modalities in Challenging Tasks

This work compares three interaction modalities for human-robot collaboration: passive, reactive, and proactive. We studied 18 participants assembling a seven-layer colored tower from memory while using nearby and distant blocks. In the…

机器人学 · 计算机科学 2026-05-14 Simone Arreghini , Cristina Iani , Alessandro Giusti , Valeria Villani , Lorenzo Sabattini , Antonio Paolillo

What Limits Vision-and-Language Navigation ?

Vision-and-Language Navigation (VLN) is a cornerstone of embodied intelligence. However, current agents often suffer from significant performance degradation when transitioning from simulation to real-world deployment, primarily due to…

机器人学 · 计算机科学 2026-05-14 Yunheng Wang , Yuetong Fang , Taowen Wang , Lusong Li , Kun Liu , Junzhe Xu , Zizhao Yuan , Yixiao Feng , Jiaxi Zhang , Wei Lu , Zecui Zeng , Renjing Xu

HCSG: Human-Centric Semantic-Geometric Reasoning for Vision-Language Navigation

VLN has achieved remarkable progress by scaling data and model capacity. However, the assumption of a static environment breaks down in real-world indoor scenarios, where robots inevitably encounter dynamic pedestrians. Existing human-aware…

机器人学 · 计算机科学 2026-05-14 Haoxuan Xu , Tianfu Li , Wenbo Chen , Yi Liu , Jin Wu , Huashuo Lei , Yunfan Lou , Lujia Wang , Hesheng Wang , Haoang Li

Galilean State Estimation for Inertial Navigation Systems with Unknown Time Delay

Many Inertial Navigation Systems (INS) use Global Navigation Satellite System (GNSS) position as the primary measurement to drive filter performance and bound error growth. However, commercial-grade GNSS receivers introduce unknown…

机器人学 · 计算机科学 2026-05-14 Giulio Delama , Martin Scheiber , Yixiao Ge , Tarek Hamel , Stephan Weiss , Robert Mahony

Calibration-Free Gas Source Localization with Mobile Robots: Source Term Estimation Based on Concentration Measurement Ranking

Efficient Gas Source Localization (GSL) in real-world settings is crucial, especially in emergency scenarios. Mobile robots equipped with low-cost, in-situ gas sensors offer a safer alternative to human inspection in hazardous environments.…

机器人学 · 计算机科学 2026-05-14 Wanting Jin , Agatha Duranceau , İzzet Kağan Erünsal , Alcherio Martinoli

Dynamics Computation of Soft-Rigid Hybrid-Link System and Its Application to Motion Analysis of an Athlete Wearing Sport Prosthesis

This paper presents a motion analysis framework for an athlete wearing sport-specific flexible prosthesis based on the soft-rigid hybrid-link system. Such a motion analysis is a challenging problem because we need to consider the…

机器人学 · 计算机科学 2026-05-14 Sunghee Kim , Yuta Shimane , Taiki Ishigaki , Ko Yamamoto

MoCCA: A Movable Circle Probability of Collision Approximation

In automated driving, crash mitigation is crucial to ensure passenger safety. Accurate avoidance requires precise knowledge of the object's position and orientation. However, sensor noise and occlusions often result in tracking and…

机器人学 · 计算机科学 2026-05-14 Tobias Kern , Christian Birkner

Multi-Depth Uniform Coverage Path Planning for Unmanned Surface Vehicle Surveying

This paper introduces a novel automatic coverage path planning algorithm for bathymetry surveying with unmanned surface vehicles. The detection range of the mapping sensor employed - a multibeam echo sounder - is heavily influenced by local…

机器人学 · 计算机科学 2026-05-14 Maider Larrazabal , Tong Yang , Izaro Goienetxea , Jaime Valls Miro

Towards Long-horizon Embodied Agents with Tool-Aligned Vision-Language-Action Models

Vision-language-action (VLA) models are effective robot action executors, but they remain limited on long-horizon tasks due to the dual burden of extended closed-loop planning and diverse physical operations. We therefore propose…

机器人学 · 计算机科学 2026-05-14 Zixing Lei , Changxing Liu , Yichen Xiong , Minhao Xiong , Yuanzhuo Ding , Zhipeng Zhang , Weixin Li , Siheng Chen

SECOND-Grasp: Semantic Contact-guided Dexterous Grasping

Achieving reliable robotic manipulation, such as dexterous grasping, requires a synergy between physically stable interactions and semantic task guidance, yet these objectives are often treated as separate, disjoint goals. In this paper, we…

机器人学 · 计算机科学 2026-05-14 Han Yi Shin , Heeju Ko , Jaewon Mun , Qixing Huang , Jaehyeok Lee , Sung June Kim , Honglak Lee , Sujin Jang , Sangpil Kim

What to Ignore, What to React: Visually Robust RL Fine-Tuning of VLA Models

Reinforcement learning (RL) fine-tuning has shown promise for Vision-Language-Action (VLA) models in robotic manipulation, but deployment-time visual shifts pose practical challenges. A key difficulty is that standard task rewards supervise…

机器人学 · 计算机科学 2026-05-14 Yuanfang Peng , Jingjing Fu , Chuheng Zhang , Li Zhao , Jiang Bian , Mingyu Liu , Ling Zhang , Jun Zhang , Rui Wang