机器人学 — Scifaro

Multi-Modal World Model for Physical Robot Interactions: Simultaneous Visual and Tactile Predictions for Enhanced Accuracy

Predicting the outcomes of robotic actions, often referred to as learning a world model, in complex environments remains a fundamental challenge in robotics. Existing approaches primarily rely on visual observations and action inputs to…

机器人学 · 计算机科学 2026-05-14 Willow Mandil , Amir Ghalamzan-E

INSANE: Cross-Domain UAV Data Sets with Increased Number of Sensors for developing Advanced and Novel Estimators

For real-world applications, autonomous mobile robotic platforms must be capable of navigating safely in a multitude of different and dynamic environments with accurate and robust localization being a key prerequisite. To support further…

机器人学 · 计算机科学 2026-05-14 Christian Brommer , Alessandro Fornasier , Martin Scheiber , Jeff Delaune , Roland Brockers , Jan Steinbrener , Stephan Weiss

Simulation-based multi-criteria comparison of mono-articular and bi-articular exoskeletons during walking with and without load

Developing exoskeletons that can reduce the metabolic cost of assisted subjects is challenging since a systematic design approach is required to capture the effects of device dynamics and the assistance torques on human performance. Design…

机器人学 · 计算机科学 2026-05-14 Ali KhalilianMotamed Bonab , Volkan Patoglu

Necessary and Sufficient Conditions for Passivity of Velocity-Sourced Impedance Control of Series Elastic Actuators

Series Elastic Actuation (SEA) has become prevalent in applications involving physical human-robot interaction as it provides considerable advantages over traditional stiff actuators in terms of stability robustness and fidelity of force…

机器人学 · 计算机科学 2026-05-14 Fatih Emre Tosun , Volkan Patoglu

SafeManip: A Property-Driven Benchmark for Temporal Safety Evaluation in Robotic Manipulation

Robotic manipulation is typically evaluated by task success, but successful completion does not guarantee safe execution. Many safety failures are temporal: a robot may touch a clean surface after contamination or release an object before…

机器人学 · 计算机科学 2026-05-13 Chengyue Huang , Khang Vo Huynh , Sebastian Elbaum , Zsolt Kira , Lu Feng

GuidedVLA: Specifying Task-Relevant Factors via Plug-and-Play Action Attention Specialization

Vision-Language-Action (VLA) models aim for general robot learning by aligning action as a modality within powerful Vision-Language Models (VLMs). Existing VLAs rely on end-to-end supervision to implicitly enable the action decoding process…

机器人学 · 计算机科学 2026-05-13 Xiaosong Jia , Bowen Yang , Zuhao Ge , Xian Nie , Yuchen Zhou , Cunxin Fan , Yufeng Li , Yilin Chai , Chao Jing , Zijian Liang , Qingwen Bu , Haidong Cao , Chao Wu , Qifeng Li , Zhenjie Yang , Chenhe Zhang , Hongyang Li , Zuxuan Wu , Junchi Yan , Yu-Gang Jiang

Real-Time Whole-Body Teleoperation of a Humanoid Robot Using IMU-Based Motion Capture with Sim2Sim and Sim2Real Validation

Stable, low-latency whole-body teleoperation of humanoid robots is an open research challenge, complicated by kinematic mismatches between human and robot morphologies, accumulated inertial sensor noise, non-trivial control latency, and…

机器人学 · 计算机科学 2026-05-13 Hamza Ahmed Durrani , Suleman Khan

SI-Diff: A Framework for Learning Search and High-Precision Insertion with a Force-Domain Diffusion Policy

Contact-rich assembly is fundamental in robotics but poses significant challenges due to uncertainties in relative poses, such as misalignments and small clearances in peg-in-hole tasks. Existing approaches typically address search and…

机器人学 · 计算机科学 2026-05-13 Yibo Liu , Stanko Oparnica , Simon Shewchun-Jakaitis , Guoyi Fu , Jie Wang , Jun Yang , Anand Jagannathan , Tony Hong-Yau Lo

TMRL: Diffusion Timestep-Modulated Pretraining Enables Exploration for Efficient Policy Finetuning

Fine-tuning pre-trained robot policies with reinforcement learning (RL) often inherits the bottlenecks introduced by pre-training with behavioral cloning (BC), which produces narrow action distributions that lack the coverage necessary for…

机器人学 · 计算机科学 2026-05-13 Matthew M. Hong , Jesse Zhang , Anusha Nagabandi , Abhishek Gupta

Morphologically Equivariant Flow Matching for Bimanual Mobile Manipulation

Mobile manipulation requires coordinated control of high-dimensional, bimanual robots. Imitation learning methods have been broadly used to solve these robotic tasks, yet typically ignore the bilateral morphological symmetry inherent in…

机器人学 · 计算机科学 2026-05-13 Max Siebenborn , Daniel Ordoñez Apraez , Sophie Lueth , Giulio Turrisi , Massimiliano Pontil , Claudio Semini , Georgia Chalvatzaki

DexTwist: Dexterous Hand Retargeting for Twist Motion via Mixed Reality-based Teleoperation

Dexterous teleoperation via Mixed Reality (MR)-based interfaces offers a scalable paradigm for transferring human manipulation skills to dexterous robot hands. However, conventional retargeting approaches that minimize kinematic…

机器人学 · 计算机科学 2026-05-13 Dongmyoung Lee , Chengxi Li , Dongheui Lee

From Imagined Futures to Executable Actions: Mixture of Latent Actions for Robot Manipulation

Video generation models offer a promising imagination mechanism for robot manipulation by predicting long-horizon future observations, but effectively exploiting these imagined futures for action execution remains challenging. Existing…

机器人学 · 计算机科学 2026-05-13 Yajie Li , Bozhou Zhang , Chun Gu , Zipei Ma , Jiahui Zhang , Jiankang Deng , Xiatian Zhu , Li Zhang

X-Imitator: Spatial-Aware Imitation Learning via Bidirectional Action-Pose Interaction

Effectively handling the interplay between spatial perception and action generation remains a critical bottleneck in robotic manipulation. Existing methods typically treat spatial perception and action execution as decoupled or strictly…

机器人学 · 计算机科学 2026-05-13 Kai Xiong , Hongjie Fang , Lixin Yang , Cewu Lu

Premover: Fast Vision-Language-Action Control by Acting Before Instructions Are Complete

Vision-Language-Action (VLA) policies are typically evaluated as if the user had finished typing or speaking before the robot begins acting. In real deployment, however, users take several seconds to enter a request, leaving the policy idle…

机器人学 · 计算机科学 2026-05-13 Joonha Park , Jiseung Jeong , Taesik Gong

World Action Models: The Next Frontier in Embodied AI

Vision-Language-Action (VLA) models have achieved strong semantic generalization for embodied policy learning, yet they learn reactive observation-to-action mappings without explicitly modeling how the physical world evolves under…

机器人学 · 计算机科学 2026-05-13 Siyin Wang , Junhao Shi , Zhaoyang Fu , Xinzhe He , Feihong Liu , Chenchen Yang , Yikang Zhou , Zhaoye Fei , Jingjing Gong , Jinlan Fu , Mike Zheng Shou , Xuanjing Huang , Xipeng Qiu , Yu-Gang Jiang

Learning What Matters: Adaptive Information-Theoretic Objectives for Robot Exploration

Designing learnable information-theoretic objectives for robot exploration remains challenging. Such objectives aim to guide exploration toward data that reduces uncertainty in model parameters, yet it is often unclear what information the…

机器人学 · 计算机科学 2026-05-13 Youwei Yu , Jionghao Wang , Zhengming Yu , Wenping Wang , Lantao Liu

Control of Fully Actuated Aerial Vehicles: A Comparison of Model-based and Sensor-based Dynamic Inversion

Fully actuated multirotor platforms decouple translational force generation from vehicle attitude, enabling independent control of position and orientation and shifting performance limitations from attitude authority to actuator dynamics…

机器人学 · 计算机科学 2026-05-13 Ali Sidar Yilmaz , Buday Turan , Lukas Pries , Markus Ryll

Closing the Motion Execution Gap: From Semantic Motion Task Constraints to Kinematic Control

This paper addresses the Motion Execution Gap, the disconnect between high-level symbolic task descriptions using semantic constraints and executable robot motions. Motion Statecharts are introduced as an executable symbolic representation…

机器人学 · 计算机科学 2026-05-13 Simon Stelter , Vanessa Hassouna , Malte Huerkamp , Michael Beetz

Cooperative Robotics Reinforced by Collective Perception for Traffic Moderation

Collisions at non-line-of-sight (NLOS) intersections remain a major safety concern because drivers have limited visibility of approaching traffic. V2X based warnings can reduce these risks, yet many vehicles are not equipped with V2X and…

机器人学 · 计算机科学 2026-05-13 Mohammad Khoshkdahan , John Pravin Arockiasamy , Andy Flores Comeca , Alexey Vinel

From Reaction to Anticipation: Proactive Failure Recovery through Agentic Task Graph for Robotic Manipulation

Although robotic manipulation has made significant progress, reliable execution remains challenging because task failures are inevitable in dynamic and unstructured environments. To handle such failures, existing frameworks typically follow…

机器人学 · 计算机科学 2026-05-13 Sheng Xu , Ruixing Jin , Huayi Zhou , Bo Yue , Guanren Qiao , Yunxin Tai , Yueci Deng , Kui Jia , Guiliang Liu