机器人学 — Scifaro

TiROD: Tiny Robotics Dataset and Benchmark for Continual Object Detection

Detecting objects with visual sensors is crucial for numerous mobile robotics applications, from autonomous navigation to inspection. However, robots often need to operate under significant domains shifts from those they were trained in,…

机器人学 · 计算机科学 2026-03-20 Francesco Pasti , Riccardo De Monte , Davide Dalle Pezze , Gian Antonio Susto , Nicola Bellotto

PLM-Net: Perception Latency Mitigation Network for Vision-Based Lateral Control of Autonomous Vehicles

This study introduces the Perception Latency Mitigation Network (PLM-Net), a modular deep learning framework designed to mitigate perception latency in vision-based imitation-learning lane-keeping systems. Perception latency, defined as the…

机器人学 · 计算机科学 2026-03-20 Aws Khalil , Jaerock Kwon

A Single-Fiber Optical Frequency Domain Reflectometry (OFDR)-Based Shape Sensing of Concentric Tube Steerable Drilling Robots

This paper introduces a novel shape-sensing approach for Concentric Tube Steerable Drilling Robots (CT-SDRs) based on Optical Frequency Domain Reflectometry (OFDR). Unlike traditional FBG-based methods, OFDR enables continuous strain…

机器人学 · 计算机科学 2026-03-19 Yash Kulkarni , Mobina Tavangarifard , Daniyal Maroufi , Mohsen Khadem , Justin E. Bird , Jeffrey H. Siewerdsen , Farshid Alambeigi

Specification-Aware Distribution Shaping for Robotics Foundation Models

Robotics foundation models have demonstrated strong capabilities in executing natural language instructions across diverse tasks and environments. However, they remain largely data-driven and lack formal guarantees on safety and…

机器人学 · 计算机科学 2026-03-19 Sadık Bera Yüksel , Derya Aksaray

DexViTac: Collecting Human Visuo-Tactile-Kinematic Demonstrations for Contact-Rich Dexterous Manipulation

Large-scale, high-quality multimodal demonstrations are essential for robot learning of contact-rich dexterous manipulation. While human-centric data collection systems lower the barrier to scaling, they struggle to capture the tactile…

机器人学 · 计算机科学 2026-03-19 Xitong Chen , Yifeng Pan , Min Li , Xiaotian Ding

ProbeFlow: Training-Free Adaptive Flow Matching for Vision-Language-Action Models

Recent Vision-Language-Action (VLA) models equipped with Flow Matching (FM) action heads achieve state-of-the-art performance in complex robot manipulation. However, the multi-step iterative ODE solving required by FM introduces inference…

机器人学 · 计算机科学 2026-03-19 Zhou Fang , Jiaqi Wang , Yi Zhou , Qiongfeng Shi

Huddle: Parallel Shape Assembly using Decentralized, Minimalistic Robots

We propose a novel algorithm for forming arbitrarily shaped assemblies using decentralized robots. By relying on local interactions, the algorithm ensures there are no unreachable states or gaps in the assembly, which are global properties.…

机器人学 · 计算机科学 2026-03-19 Khai Yi Chin , Tingwei Meng , Zhe Chen , Daniel Bassett , Yuri Ivanov

VolumeDP: Modeling Volumetric Representation for Manipulation Policy Learning

Imitation learning is a prominent paradigm for robotic manipulation. However, existing visual imitation methods map 2D image observations directly to 3D action outputs, imposing a 2D-3D mismatch that hinders spatial reasoning and degrades…

机器人学 · 计算机科学 2026-03-19 Tianxing Zhou , Feiyang Xue , Zhangchen Ye , Tianyuan Yuan , Hang Zhao , Tao Jiang

AERR-Nav: Adaptive Exploration-Recovery-Reminiscing Strategy for Zero-Shot Object Navigation

Zero-Shot Object Navigation (ZSON) in unknown multi-floor environments presents a significant challenge. Recent methods, mostly based on semantic value greedy waypoint selection, spatial topology-enhanced memory, and Multimodal Large…

机器人学 · 计算机科学 2026-03-19 Jingzhi Huang , Junkai Huang , Haoyang Yang , Haoang Li , Yi Wang

Consistency-Driven Dual LSTM Models for Kinematic Control of a Wearable Soft Robotic Arm

In this paper, we introduce a consistency-driven dual LSTM framework for accurately learning both the forward and inverse kinematics of a pneumatically actuated soft robotic arm integrated into a wearable device. This approach effectively…

机器人学 · 计算机科学 2026-03-19 Xingyu Chen , Yi Xiong , Li Wen

AgentVLN: Towards Agentic Vision-and-Language Navigation

Vision-and-Language Navigation (VLN) requires an embodied agent to ground complex natural-language instructions into long-horizon navigation in unseen environments. While Vision-Language Models (VLMs) offer strong 2D semantic understanding,…

机器人学 · 计算机科学 2026-03-19 Zihao Xin , Wentong Li , Yixuan Jiang , Ziyuan Huang , Bin Wang , Piji Li , Jianke Zhu , Jie Qin , Shengjun Huang

REAL: Robust Extreme Agility via Spatio-Temporal Policy Learning and Physics-Guided Filtering

Extreme legged parkour demands rapid terrain assessment and precise foot placement under highly dynamic conditions. While recent learning-based systems achieve impressive agility, they remain fundamentally fragile to perceptual degradation,…

机器人学 · 计算机科学 2026-03-19 Jialong Liu , Dehan Shen , Yanbo Wen , Zeyu Jiang , Changhao Chen

VectorWorld: Efficient Streaming World Model via Diffusion Flow on Vector Graphs

Closed-loop evaluation of autonomous-driving policies requires interactive simulation beyond log replay. However, existing generative world models often degrade in closed loop due to (i) history-free initialization that mismatches policy…

机器人学 · 计算机科学 2026-03-19 Chaokang Jiang , Desen Zhou , Jiuming Liu , Kevin Li Sun

KineVLA: Towards Kinematics-Aware Vision-Language-Action Models with Bi-Level Action Decomposition

In this paper, we introduce a novel kinematics-rich vision-language-action (VLA) task, in which language commands densely encode diverse kinematic attributes (such as direction, trajectory, orientation, and relative displacement) from…

机器人学 · 计算机科学 2026-03-19 Gaoge Han , Zhengqing Gao , Ziwen Li , Jiaxin Huang , Shaoli Huang , Fakhri Karray , Mingming Gong , Tongliang Liu

Bringing Network Coding into Multi-Robot Systems: Interplay Study for Autonomous Systems over Wireless Communications

Communication is a core enabler for multi-robot systems (MRS), providing the mechanism through which robots exchange state information, coordinate actions, and satisfy safety constraints. While many MRS autonomy algorithms assume reliable…

机器人学 · 计算机科学 2026-03-19 Anil Zaher , Kiril Solovey , Alejandro Cohen

P$^{3}$Nav: End-to-End Perception, Prediction and Planning for Vision-and-Language Navigation

In Vision-and-Language Navigation (VLN), an agent is required to plan a path to the target specified by the language instruction, using its visual observations. Consequently, prevailing VLN methods primarily focus on building powerful…

机器人学 · 计算机科学 2026-03-19 Tianfu Li , Wenbo Chen , Haoxuan Xu , Xinhu Zheng , Haoang Li

FloorPlan-VLN: A New Paradigm for Floor Plan Guided Vision-Language Navigation

Existing Vision-Language Navigation (VLN) task requires agents to follow verbose instructions, ignoring some potentially useful global spatial priors, limiting their capability to reason about spatial structures. Although human-readable…

机器人学 · 计算机科学 2026-03-19 Kehan Chen , Yan Huang , Dong An , Jiawei He , Yifei Su , Jing Liu , Nianfeng Liu , Liang Wang

SafeLand: Safe Autonomous Landing in Unknown Environments with Bayesian Semantic Mapping

Autonomous landing of uncrewed aerial vehicles (UAVs) in unknown, dynamic environments poses significant safety challenges, particularly near people and infrastructure, as UAVs transition to routine urban and rural operations. Existing…

机器人学 · 计算机科学 2026-03-19 Markus Gross , Andreas Greiner , Sai Bharadhwaj Matha , Felix Soest , Daniel Cremers , Henri Meeß

Physics-informed Deep Mixture-of-Koopmans Vehicle Dynamics Model with Dual-branch Encoder for Distributed Electric-drive Trucks

Advanced autonomous driving systems require accurate vehicle dynamics modeling. However, identifying a precise dynamics model remains challenging due to strong nonlinearities and the coupled longitudinal and lateral dynamic characteristics.…

机器人学 · 计算机科学 2026-03-19 Jinyu Miao , Pu Zhang , Rujun Yan , Yifei He , Bowei Zhang , Zheng Fu , Ke Wang , Qi Song , Kun Jiang , Mengmeng Yang , Diange Yang

OmniVLN: Omnidirectional 3D Perception and Token-Efficient LLM Reasoning for Visual-Language Navigation across Air and Ground Platforms

Language-guided embodied navigation requires an agent to interpret object-referential instructions, search across multiple rooms, localize the referenced target, and execute reliable motion toward it. Existing systems remain limited in real…

机器人学 · 计算机科学 2026-03-19 Zhongyuang Liu , Min He , Shaonan Yu , Xinhang Xu , Muqing Cao , Jianping Li , Jianfei Yang , Lihua Xie