机器人学 — Scifaro

VLA Models Are More Generalizable Than You Think: Revisiting Physical and Spatial Modeling

Vision-language-action (VLA) models achieve strong in-distribution performance but degrade sharply under novel camera viewpoints and visual perturbations. We show that this brittleness primarily arises from misalignment in Spatial Modeling,…

机器人学 · 计算机科学 2026-04-01 Weiqi Li , Quande Zhang , Ruifeng Zhai , Liang Lin , Guangrun Wang

Scaling Cross-Environment Failure Reasoning Data for Vision-Language Robotic Manipulation

Robust robotic manipulation requires reliable failure detection and recovery. Although recent Vision-Language Models (VLMs) show promise in robot failure detection, their generalization is severely limited by the scarcity and narrow…

机器人学 · 计算机科学 2026-04-01 Paul Pacaud , Ricardo Garcia , Shizhe Chen , Cordelia Schmid

Masked IRL: LLM-Guided Reward Disambiguation from Demonstrations and Language

Robots can adapt to user preferences by learning reward functions from demonstrations, but with limited data, reward models often overfit to spurious correlations and fail to generalize. This happens because demonstrations show robots how…

机器人学 · 计算机科学 2026-04-01 Minyoung Hwang , Alexandra Forsey-Smerek , Nathaniel Dennler , Andreea Bobu

Towards High-Consistency Embodied World Model with Multi-View Trajectory Videos

Embodied world models aim to predict and interact with the physical world through visual observations and actions. However, existing models struggle to accurately translate low-level actions (e.g., joint positions) into precise robotic…

机器人学 · 计算机科学 2026-04-01 Taiyi Su , Jian Zhu , Yaxuan Li , Chong Ma , Jianjun Zhang , Zitai Huang , Hanli Wang , Yi Xu

Stein-based Optimization of Sampling Distributions in Model Predictive Path Integral Control

This paper introduces a method for Model Predictive Path Integral (MPPI) control that optimizes sample generation towards an optimal trajectory through Stein Variational Gradient Descent (SVGD). MPPI relies upon predictive rollout of…

机器人学 · 计算机科学 2026-04-01 Jace Aldrich , Odest Chadwicke Jenkins

Interactive Force-Impedance Control

Human collaboration with robots requires flexible role adaptation, enabling the robot to switch between an active leader and a passive follower. Effective role switching depends on accurately estimating human intentions, which is typically…

机器人学 · 计算机科学 2026-04-01 Fan Shao , Satoshi Endo , Sandra Hirche , Fanny Ficuciello

MSG: Multi-Stream Generative Policies for Sample-Efficient Robotic Manipulation

Generative robot policies such as Flow Matching offer flexible, multi-modal policy learning but are sample-inefficient. Although object-centric policies improve sample efficiency, it does not resolve this limitation. In this work, we…

机器人学 · 计算机科学 2026-04-01 Jan Ole von Hartz , Lukas Schweizer , Joschka Boedecker , Abhinav Valada

DCReg: Decoupled Characterization for Efficient Degenerate LiDAR Registration

LiDAR point cloud registration is fundamental to robotic perception and navigation. In geometrically degenerate environments (e.g., corridors), registration becomes ill-conditioned: certain motion directions are weakly constrained, causing…

机器人学 · 计算机科学 2026-04-01 Xiangcheng Hu , Xieyuanli Chen , Mingkai Jia , Jin Wu , Ping Tan , Steven L. Waslander

UniLGL: Learning Uniform Place Recognition for FOV-limited/Panoramic LiDAR Global Localization

Existing LGL methods typically consider only partial information (e.g., geometric features) from LiDAR observations or are designed for homogeneous LiDAR sensors, overlooking the uniformity in LGL. In this work, a uniform LGL method is…

机器人学 · 计算机科学 2026-04-01 Hongming Shen , Xun Chen , Yulin Hui , Zhenyu Wu , Wei Wang , Qiyang Lyu , Tianchen Deng , Danwei Wang

Comparison of Localization Algorithms between Reduced-Scale and Real-Sized Vehicles Using Visual and Inertial Sensors

Physically reduced-scale vehicles are emerging to accelerate the development of advanced automated driving functions. In this paper, we investigate the effects of scaling on self-localization accuracy with visual and visual-inertial…

机器人学 · 计算机科学 2026-04-01 Tobias Kern , Leon Tolksdorf , Christian Birkner

Generation of Indoor Open Street Maps for Robot Navigation from CAD Files

The deployment of autonomous mobile robots is predicated on the availability of environmental maps, yet conventional generation via SLAM (Simultaneous Localization and Mapping) suffers from significant limitations in time, labor, and…

机器人学 · 计算机科学 2026-04-01 Jiajie Zhang , Shenrui Wu , Xu Ma , Sören Schwertfeger

Real-Time Operator Takeover for Visuomotor Diffusion Policy Training

We present a Real-Time Operator Takeover (RTOT) paradigm that enables operators to seamlessly take control of a live visuomotor diffusion policy, guiding the system back to desirable states or providing targeted corrective demonstrations.…

机器人学 · 计算机科学 2026-04-01 Marco Moletta , Michael C. Welle , Nils Ingelhag , Jesper Munkeby , Danica Kragic

Where to Look Next: Learning Viewpoint Recommendations for Informative Trajectory Planning

Search missions require motion planning and navigation methods for information gathering that continuously replan based on new observations of the robot's surroundings. Current methods for information gathering, such as Monte Carlo Tree…

机器人学 · 计算机科学 2026-04-01 Max Lodel , Bruno Brito , Álvaro Serra-Gómez , Laura Ferranti , Robert Babuška , Javier Alonso-Mora

Snake Robot Gait Decomposition and Gait Parameter Optimization

This paper proposes Gait Decomposition (G.D), a method of mathematically decomposing snake movements, and Gait Parameter Gradient (GPG), a method of optimizing decomposed gait parameters. G.D is a method that can express the snake gait…

机器人学 · 计算机科学 2026-04-01 Bongsub Song , Insung Ju , Dongwon Yun

FocusVLA: Focused Visual Utilization for Vision-Language-Action Models

Vision-Language-Action (VLA) models improve action generation by conditioning policies on rich vision-language information. However, current auto-regressive policies are constrained by three bottlenecks: (1) architectural bias drives models…

机器人学 · 计算机科学 2026-03-31 Yichi Zhang , Weihao Yuan , Yizhuo Zhang , Xidong Zhang , Jia Wan

Pandora: Articulated 3D Scene Graphs from Egocentric Vision

Robotic mapping systems typically approach building metric-semantic scene representations from the robot's own sensors and cameras. However, these "first person" maps inherit the robot's own limitations due to its embodiment or skillset,…

机器人学 · 计算机科学 2026-03-31 Alan Yu , Yun Chang , Christopher Xie , Luca Carlone

DRIVE-Nav: Directional Reasoning, Inspection, and Verification for Efficient Open-Vocabulary Navigation

Open-Vocabulary Object Navigation (OVON) requires an embodied agent to locate a language-specified target in unknown environments. Existing zero-shot methods often reason over dense frontier points under incomplete observations, causing…

机器人学 · 计算机科学 2026-03-31 Maoguo Gao , Zejun Zhu , Zhiming Sun , Zhengwei Ma , Longze Yuan , Zhongjing Ma , Zhigang Gao , Jinhui Zhang , Suli Zou

Vision-Based Robotic Disassembly Combined with Real-Time MFA Data Acquisition

Stable and reliable supplies of rare-Earth minerals and critical raw materials (CRMs) are essential for the development of the European Union. Since a large share of these materials enters the Union from outside, a valid option for CRMs…

机器人学 · 计算机科学 2026-03-31 Federico Zocco , Maria Pozzi , Monica Malvezzi

Serialized Red-Green-Gray: Quicker Heuristic Validation of Edges in Dynamic Roadmap Graphs

Motion planning in dynamic environments, such as robotic warehouses, requires fast adaptation to frequent changes in obstacle poses. Traditional roadmap-based methods struggle in such settings, relying on inefficient reconstruction of a…

机器人学 · 计算机科学 2026-03-31 Yulie Arad , Stav Ashur , Marta Markowicz , James D. Motes , Marco Morales , Nancy M. Amato

Dynamic Lookahead Distance via Reinforcement Learning-Based Pure Pursuit for Autonomous Racing

Pure Pursuit (PP) is a widely used path-tracking algorithm in autonomous vehicles due to its simplicity and real-time performance. However, its effectiveness is sensitive to the choice of lookahead distance: shorter values improve cornering…

机器人学 · 计算机科学 2026-03-31 Mohamed Elgouhary , Amr S. El-Wakeel