机器人学 — Scifaro

AERIS: Aerial-Edge Role-Driven Intelligence at Runtime via Orchestrated Language-Model Swarm

Integrating large language models into robotic systems holds promise for enhancing autonomy, yet practical deployment remains constrained by strict heartbeat-constrained scheduling and limited computational power. We propose AERIS: an edge…

机器人学 · 计算机科学 2026-06-29 Jiabin Lou , Haopeng Wang , Xinyu Liu , Yu Zhang , Rongye Shi , Wenjun Wu

SA-VLA: State-aware tokenizer for improving Vision-Language-Action Models' performance

Discrete action tokenization provides a compact interface for autoregressive VLA policies, but accurately recovering continuous robot actions from discrete codes remains challenging. Existing tokenizers typically map each discrete code to a…

机器人学 · 计算机科学 2026-06-29 Tengyue Jiang , Chunpu Xu , Jiayue Kang , Yao Mu

Automating the Design of Embodied AgentArchitectures

Embodied agents are typically built as hand-designed compositions of perception, memory, planning, and action modules. This modularity exposes a large architectural design space, but current systems still rely on researcher intuition to…

机器人学 · 计算机科学 2026-06-29 Jian Zhou , Sihao Lin , Jin Li , Shuai Fu , Gengze Zhou , Qi Wu

TacEvo: Self-Evolving Architecture Discovery for Robotic Tactile Perception via LLM-Driven Quality-Diversity Search

Vision-based tactile sensing converts contact-induced surface deformation into images, enabling robots to infer contact forces and fine surface textures that are not accessible through conventional vision alone. However, tactile images are…

机器人学 · 计算机科学 2026-06-29 Mohammed AbuSadeh , Lan Wei , Dandan Zhang

SIR: Structured Image Representations for Explainable Robot Learning

Existing robot policies based on learned visual embeddings lack explicit structure and are sensitive to visual distractions. Thus, the representations that drive their behaviour are often opaque, making their decision-making process…

机器人学 · 计算机科学 2026-06-29 Paul Mattes , Jan Schwab , Jens Bosch , Nils Blank , Maximilian Xiling Li , Minh-Trung Tang , Moritz Haberland , Rudolf Lioutikov

Heterogeneous Tactile Transformer

Tactile sensors are inherently heterogeneous: a model trained on one sensor cannot be directly used on another, which limits learning contact-rich manipulation policies from diverse tactile data at scale. To bridge this gap, we propose the…

机器人学 · 计算机科学 2026-06-29 Jianxin Bi , Qiang Wang , Jayaram Reddy , Kelvin Lin , Soibkhon Khajikhanov , Ruihan Gao , Harold Soh

Seeing Touch from Motion: A Unified Modality-Aware Visuo-Tactile Policy with Tactile Motion Correlation

Visuo-Tactile policies leveraging optical tactile sensors have shown great promise in contact-rich manipulation. These sensors achieve high spatial resolution and multi-dimensional force sensing by utilizing an internal camera to monitor…

机器人学 · 计算机科学 2026-06-29 Shengqi Xu , Guojin Zhong , Yang Liu , Fanjie Wang , Hu Luo , Hanyu Zhou , Weiyao Zhang , Ziyi Ye , Zuxuan Wu , Yu-Gang Jiang

WARP: Whole-Body Retargeting for Learning from Offline Human Demonstrations

Direct transfer from human demonstration to learnable robot action is a crucial step towards scalable whole-body mobile manipulation. While human data scales better than mobile teleoperation, it requires overcoming significant embodiment…

机器人学 · 计算机科学 2026-06-29 Zhenyang Chen , Chuizheng Kong , Chuye Zhang , Yuanshao Yang , Lawrence Y. Zhu , Shreyas Kousik , Danfei Xu

REPAIR-Bench: A Benchmark for Robot Error Perception And Interaction Recovery

Understanding how users perceive and respond to robot failures is essential for building robust and trustworthy robot systems. Prior work, however, (i) often treats failures as independent events, (ii) emphasizes binary failure detection,…

机器人学 · 计算机科学 2026-06-29 Giuliano Pioldi , Yashika Batra , Arman Ibrayeva , Yuanchen Bai , Purnjay Maruur , Promise Ekpo , Angelique Taylor

OpenSPM: An Environment-Transferable Robotic Key Spatial Pose Memory and Closed-Loop High-Frequency Flow-Matching Action Generation Model

Open-environment tabletop robotic manipulation requires systems to possess semantic understanding, precise geometric pose estimation, and high-frequency action generation. While end-to-end vision-language-action (VLA) models excel at…

机器人学 · 计算机科学 2026-06-29 Iok Tong Lei , Qingchen Xie , Yifan Wang , Yap Ying Jie , Zhidong Deng

RoamFlow: Reinforcement-Aligned One-Step Action MeanFlow Policy for Image-Goal Navigation

Image-goal navigation is a key challenge in embodied robotics, where an agent must reach a target specified solely by a goal image. While existing reinforcement learning approaches map perceptual observations directly to actions, they…

机器人学 · 计算机科学 2026-06-29 Zixuan Zhang , Yuqi Chen , Junjie Gao , Siyuan Song , Yongzhou Pan , Beichen Wang , Mir Feroskhan

Flying to Image-Specified Objects: 3D Quadrotor Navigation via Cross-Graph Memory and Viewpoint Planning

Instance-Specific Image-Goal Navigation (InstanceImageNav) requires a robot to navigate toward the exact object instance depicted in a query image. Extending this task to quadrotors is challenging due to continuous 3D control, limited field…

机器人学 · 计算机科学 2026-06-29 Junjie Gao , Yuqi Chen , Yongzhou Pan , Yaosheng Deng , Jiaping Xiao , Mir Feroskhan

Sphere-VIO: Fast and Robust Visual-Inertial Odometry via Unified Spherical Representation for Heterogeneous Multi-Camera Systems

Multi-camera visual-inertial odometry (VIO) overcomes the inherent limitations of pure visual systems by expanding the field of view. However, existing algorithms are typically tailored for fixed camera setups and lack unified compatibility…

机器人学 · 计算机科学 2026-06-29 Yueteng Yang , Yusen Xie , Hao Wei , Qianhao Wang , Boyu Zhou , Fei Gao , Jun Ma , Jinni Zhou

Pondering the Way: Spatial-perceiving World Action Model for Embodied Navigation

Existing world model-based planners for visual navigation typically follow a verification-centric paradigm, decoupling goal intent from trajectory synthesis. This approach suffers from candidate dependence, heavy computational overhead, and…

机器人学 · 计算机科学 2026-06-29 Hong Chen , Daqi Liu , Zehan Zhang , Haiguang Wang , Tianhao Lu , Longfei Yan , Haiyang Sun , Fangzhen Li , Hongwei Xie , Bing Wang , Guang Chen , Hangjun Ye , Yihua Tan

Critical Interval MSE: Toward Reliable Offline Validation for Robot Manipulation Policies

Real-world evaluation is the gold standard for robot policies because it tests them against the physical conditions and deployment challenges they are ultimately designed to handle. However, real-world evaluation is also the bottleneck for…

机器人学 · 计算机科学 2026-06-29 Haoxu Huang , Tongsam Zheng , Yifan Chen , Jiacheng You , Yang Gao

Trust Your Instincts: Confidence-Driven Test-Time RL for Vision-Language-Action Models

Reinforcement learning (RL) has become indispensable for pushing Vision-Language-Action Models (VLAs) beyond static imitation learning. However, existing RL methods typically require external environmental feedback, relying on predefined…

机器人学 · 计算机科学 2026-06-29 Siyao Chen , Jiakang Yuan , Jiaxin Wang , Tao Chen

AUSLUN: A Fixed-Hover UAV--USV System for GNSS-Denied Maritime Search and Navigation

Global navigation satellite system (GNSS) denial can prevent an unmanned surface vehicle (USV) from both finding a distant vessel and maintaining a globally referenced approach. This paper presents AUSLUN (Automatic UAV Search,…

机器人学 · 计算机科学 2026-06-29 Siyuan Yang , Zikai Jia , Hailiang Kuang , Xiaoyu He , Qizhi Guo , Yihao Dong , Shaoming He

Normalizing Flow-Enhanced Message Passing for Multirobot Collaborative Localization

Accurate, robust, and adaptive localization is essential for various robotic operations. This paper proposes a new message passing (MP) algorithm for realizing collaborative localization in a distributed manner. The algorithm unifies…

机器人学 · 计算机科学 2026-06-29 Han Shen , Guanghui Wen , Liangming Chen , Ming Cao

TACO: A Test and Check Framework for Robust Pose Graph Optimization

Pose Graph Optimization (PGO) is one of the most widely adopted approaches for solving Simultaneous Localization and Mapping (SLAM) problems. However, PGO approaches are particularly sensitive to outliers, which can substantially degrade…

机器人学 · 计算机科学 2026-06-29 Emilio Olivastri , Alberto Pretto , Tobias Fischer

Legible Shared Autonomy: Implicit Communication of Robot Belief through Motion

Shared autonomy systems combine user input with autonomous assistance to help users with motor impairments control robot arms to perform everyday manipulation tasks, by inferring user goals and providing appropriate guidance. However, the…

机器人学 · 计算机科学 2026-06-29 Jinwei Liu , Pengfei Li , Shaofeng Chen , Tao Wang , Yun-Bo Zhao