机器人学 — Scifaro

Tactile-based Multimodal Fusion in Embodied Intelligence: A Survey of Vision, Language, and Contact-Driven Paradigms

Tactile sensing is a fundamental modality for embodied intelligence, offering unique and direct feedback on contact geometry, material properties, and interaction dynamics that remote sensors cannot replace. However, unimodal tactile…

机器人学 · 计算机科学 2026-05-19 Zhixiang Cao , Di Tian , Runwei Guan , Yanzhou Mu , Xiaolou Sun , Shaofeng Liang , Daizong Liu , Tao Huang , Yutao Yue , Henghui Ding , Bin Fang , Alex Zhou , Qing-Long Han , Hui Xiong

Efficient Feature-Free Initialization for Monocular Visual-Inertial Systems Using a Feed-Forward 3D Model

Fast and reliable initialization is critical for monocular visual-inertial navigation systems (VINS), as it establishes the starting conditions for subsequent state estimation. Despite steady progress, most existing methods heavily rely on…

机器人学 · 计算机科学 2026-05-19 Yuantai Zhang , Jiaqi Yang , Huajian Zeng , Changhao Chen , Haoang Li , Liang Li , Dezhen Song , Xingxing Zuo

Beyond Geometry: Efficient Topologically-Grounded Navigation in Complex 3D Environments

Ground robot navigation in complex 3D environments is often hindered by geometric ambiguity, where non-traversable structures such as furniture share local geometric properties with navigable ground. Furthermore, the computational cost of…

机器人学 · 计算机科学 2026-05-19 Yifan Du , Chengwei Zhang , Siyu Liao , Zhongfeng Wang

HCLM: A Hierarchical Framework for Cooperative Loco-Manipulation with Dual Quadrupeds

We introduce HCLM, a hierarchical framework for general-purpose cooperative loco-manipulation with dual quadrupedal systems. Coordinating multi-robot collaborative manipulation across floating bases is highly challenging due to the…

机器人学 · 计算机科学 2026-05-19 Qixuan Li , Chen Le , Jincheng Yu , Xinlei Chen

Task Capability Improvement Algorithm for Collaborative Manipulators

This work introduces a cooperative task capability improvement utilizing additional moments. The manipulators apply forces at the object's grasp point. Applying forces at a point other than the object's center of gravity produces undesired…

机器人学 · 计算机科学 2026-05-19 Keshab Patra , Arpita Sinha , Anirban Guha

Stretch-ICP: A Continuous-Trajectory Registration and Deskewing Algorithm in Scenarios of Aggressive Motions

Robust robotic autonomy remains challenging in complex environments, where loss of stability on uneven or slippery terrain can induce extreme accelerations and angular velocities. Such motions corrupt sensor measurements and degrade state…

机器人学 · 计算机科学 2026-05-19 Simon-Pierre Deschênes , Veronica Vannini , Philippe Giguère , François Pomerleau

SEDualVLN: A Spatially-Enhanced Dual-System for Vision-Language Navigation

Vision-Language Navigation (VLN) approaches have currently followed two primary paradigms: the end-to-end Vision-Language Model (VLM) policy fine-tuned on navigation trajectories to directly predict actions, and the zero-shot modular…

机器人学 · 计算机科学 2026-05-19 Jingzhi Huang , Junkai Huang , Wenxuan Song , Haoyang Yang , Hailong Huang , Haoang Li , Yi Wang

Generating Realistic Safety-Critical Scenarios for Vehicle-Pedestrian Interactions

Automated driving system deployment requires rigorous validation across safety-critical vehicle-pedestrian interactions, yet real-world datasets rarely capture high-risk scenarios while simulation platforms lack realistic behavior. In…

机器人学 · 计算机科学 2026-05-19 Qingwen Pu , Kun Xie , Yuan Zhu , Guocong Zhai

Event-Grounded Sparse Autoencoders for Vision-Language-Action Policies

Vision-Language-Action (VLA) policies translate language and visual inputs into robot actions, where their hidden representations directly shape closed-loop behavior. However, mechanistic interpretability tools from language and…

机器人学 · 计算机科学 2026-05-19 Xinchen Jin , Aditya Chatterjee , Pranav Kumar , Rohan Paleja

Contrastive Conceptor Activation Steering (COAST): Unlocking Vision-Language-Action Models through Hidden States

Vision-Language-Action (VLA) models leverage powerful perceptual priors from web-scale Vision-Language Model (VLM) pre-training, yet they remain surprisingly brittle in practice, frequently failing at simple robotic tasks. To mitigate this,…

机器人学 · 计算机科学 2026-05-19 Miranda Muqing Miao , Subin Kim , Brandon Yang , Lyle Ungar

How to Instruct Your Robot: Dense Language Annotations Power Robot Policy Learning

Scaling robot policy learning is bottlenecked by the cost of collecting demonstrations, while language annotations for existing demonstrations are comparatively cheap. We study language density as a lever for extracting more signal from a…

机器人学 · 计算机科学 2026-05-19 Bosung Kim , Ruiyi Wang , David Acuna , Jaehun Jung , Alexander Trevithick , Brandon Cui , Yejin Choi , Prithviraj Ammanabrolu

Generalizable and Actionable Parts Pose Estimation with Symmetry Annotation-Free Learning Strategy

Urgently needed generalizable robot object interaction and manipulation requires high-quality Cross-Category object perception. As a pioneer of this area, Generalizable and Actionable Parts (GAParts) understanding has attracted increasing…

机器人学 · 计算机科学 2026-05-19 Wenxiao Chen , Xueyu Yuan , Liu Liu , Di Wu , Dan Guo

NORM-Nav: Zero-Shot Mobile Robot Navigation with Natural Language Behavioral Constraints

Mobile robots operating in human-centered environments must generate not only collision-free paths but also trajectories that follow local behavioral conventions. Conventional costmap-based navigation emphasizes geometric feasibility and…

机器人学 · 计算机科学 2026-05-19 Dongjie Huo , Junhui Wang , Chao Gao , Yan Qiao , Dong Zhang , Guyue Zhou

MORN: Metacognitive Object-Goal Regulation for Resource-Rational Long-Horizon Navigation

Robots deployed in unstructured human environments must frequently execute long-horizon missions, such as find the mug, then the chair, then the printer, under strict operational constraints. While contemporary zero-shot Object Navigation…

机器人学 · 计算机科学 2026-05-19 Xi Lin , Jiayi Li , Kangyi Wu , Jiaqiao Tang , Qingrong He , Lin Zhao

Beyond Safety Filtering: Control Barrier Function-Informed Reinforcement Learning for Connected and Automated Vehicles

Reinforcement Learning (RL) uses rewards to guide learning, yet reward design is typically hand-crafted using heuristics that can be difficult to tune. We propose a Control Barrier Function (CBF)-informed reward design for Multi-Agent RL…

机器人学 · 计算机科学 2026-05-19 Jianye Xu , Bassam Alrifaee

SADP: Subgoal-Aware Diffusion Policy for Explainable Robots Learned from Foundation Model Generated Demonstrations

Explainable robots require not only successful task execution but also the ability to expose internal decision-making process in a user-friendly manner. However, most imitation learning methods are trained solely on task-level…

机器人学 · 计算机科学 2026-05-19 Site Hu , Takato Horii

SSTL: Self-Sensing Tendon Loop for Hysteresis Modeling and Compensation in Tendon-Sheath Mechanisms

Flexible endoscopic robots enable minimally invasive access through natural orifices, but their control accuracy is limited by configuration-dependent hysteresis in the tendon-sheath mechanisms (TSMs). Tendon-sheath friction and tendon…

机器人学 · 计算机科学 2026-05-19 Myeongbo Park , Junhyun Park , Ihsan Ullah , Chunggil An , Minho Hwang

Plan First, Diffuse Later: Extrinsic Graph Guidance for Long-Horizon Diffusion Planning

Compositional diffusion models offer a promising route to long-horizon planning by denoising multiple overlapping sub-trajectories while ensuring that together they constitute a global solution. However, enforcing local behavior over long…

机器人学 · 计算机科学 2026-05-19 Yaniv Hassidof , Adir Morgan , Yilun Du , Kiril Solovey

Pedestrian-Aware LLM-Driven Behavioral Planning for Autonomous Vehicles

Autonomous Vehicles (AVs) must make reliable decisions in dense urban environments where pedestrian behavior is variable, sometimes abnormal, and often unseen during training. Reinforcement learning (RL)-based AV control systems perform…

机器人学 · 计算机科学 2026-05-19 Aidana Baimbetova , Haruki Yonekura , Hamada Rizk , Hirozumi Yamaguchi

"I'm Not Mad, Just Focused'': Understanding Human Emotions in Human-Robot Collaboration

Human-robot collaboration (HRC) can benefit from robots' abilities to interpret human emotional states. However, current emotion recognition (ER) models in HRC often fall short, particularly due to their reliance on acted datasets and…

机器人学 · 计算机科学 2026-05-19 Seung Chan Hong , Dana Kulić , Leimin Tian