机器人学 — Scifaro

POINav: Benchmarking and Enhancing Final-Meters Arrival in Real-World Vision-Language Navigation

Real-world navigation is fundamentally driven by Points of Interest (POIs), yet reaching a precise POI remains a critical "final-meters" challenge. Existing Vision-Language Navigation (VLN) benchmarks of POI-goal navigation often suffer…

机器人学 · 计算机科学 2026-05-28 Ruiyan Gong , Meisheng Zhang , Yuxiang Zhao , Mingchao Sun , Yanfen Shen , Zedong Chu , Zhining Gu , Wei Guo , Xiaolong Cheng , Qiming Li , Kangning Niu , Yanqing Zhu , Xiaolong Wu , Tianlun Li , Mu Xu

ProgVLA: Progress-Aware Robot Manipulation Skill Learning

We present ProgVLA, a compact vision-language-action (VLA) model designed for reliable robot manipulation under tight compute and memory budgets. The model specifically focuses on efficiently processing long multi-modal sequences by…

机器人学 · 计算机科学 2026-05-28 Seungsu Kim , Jinyoung Choi , Seungmin Baek , Jean-Michel Renders

Natural Functional Gradients for Smooth Trajectory Optimization

Generating collision-free and smooth motions remains a central challenge in robotic manipulation, particularly in cluttered environments and narrow passages where feasible regions are highly constrained and fragmented. We propose a…

机器人学 · 计算机科学 2026-05-28 Kisang Park , Chanwoo Kim , Kyungjae Lee , Sungjoon Choi

Visualizing Latent Phase Structures in Locomotion Policies: A Multi-Environment Study with Temporal Feature Extension

Deep reinforcement learning (DRL) has been shown to achieve high performance on locomotion control tasks in MuJoCo benchmarks such as HalfCheetah, Ant, and Walker2D. However, visualizing the motion structures internally obtained by a…

机器人学 · 计算机科学 2026-05-28 Daisuke Yasui , Toshitaka Matuki , Hiroshi Sato

Provably Guaranteed Polytopic Uncertainty Quantification for SLAM

In safety-critical robotics applications, guaranteed and practical uncertainty quantification (UQ) in perception is vital. Many existing works either offer no formal containment guarantee, rely on restrictive modeling assumptions, or focus…

机器人学 · 计算机科学 2026-05-28 Guangyang Zeng , Yulong Gao , Yuan Shen , Lingpeng Chen , Haoying Li , Guodong Shi , Junfeng Wu

STR Robot: Design of an Autonomous Mobile Robot from Simulation to Reality

With the rapid development of simulation tools, the development and validation of autonomous robotic systems have become more efficient before real-world deployment. This paper presents a simulation-to-real implementation of an autonomous…

机器人学 · 计算机科学 2026-05-28 Vinh Nguyen , Gia-Uy Le , Tien-Dat Nguyen , Tri-Tin Nguyen , Vinh-Hao Nguyen

ICAN-Deploy: Identity-Stable Canary Deployment for Safety-Critical Embodied Agents

Canary deployment routes a fraction of traffic to a new software version, monitors metrics, and rolls back on regression. Mainstream controllers (Argo Rollouts, Spinnaker, Flagger) change the deployed system's cryptographic identity during…

机器人学 · 计算机科学 2026-05-28 Xue Qin , Simin Luan , John See , Zeyd Boukhers , Cong Yang , Zhijun Li

An Operator-Based Approach to STL

Signal Temporal Logic (STL), has recently seen extensive development, owing to its rich expressivenes for autonomous planning and control. Nevertheless, existing verification and control synthesis methods are limited with respect to the…

机器人学 · 计算机科学 2026-05-28 Panagiotis Rousseas , Dimos V. Dimarogonas

Whose Is This?: Context-Aware Object Ownership Inference with Uncertainty-Guided Questioning

Service robots must infer object ownership to correctly interpret instructions such as "bring me my cup." However, ownership is a latent attribute that cannot be directly observed, and existing methods often rely on limited cues such as…

机器人学 · 计算机科学 2026-05-28 Saki Hashimoto , Akira Taniguchi , Shoichi Hasegawa , Yoshinobu Hagiwara , Tadahiro Taniguchi

SAFEVPR: Patch-Based Conformal Verification for Safe Cross-Condition Sequence Visual Place Recognition

Sequence-based visual place recognition (VPR) for SLAM and robot relocalization must decide whether the retrieved top-1 candidate is safe to accept. Conformal prediction is a natural framework for this accept/reject decision, but its…

机器人学 · 计算机科学 2026-05-28 Ha Sier , Jiaqiang Zhang , Zhuo Zou , Xianjia Yu , Tomi Westerlund

How Should We Teach Robots? A Comparison of Kinesthetic, Joystick, and Gesture-Based Teaching

Instructing robots from demonstrations can be done through different teaching modalities, each with different usability and performance trade-offs. This paper compares kinesthetic guidance, joystick teleoperation, and hand gestures in a…

机器人学 · 计算机科学 2026-05-28 Petr Vanc , Jan Kristof Behrens , Václav Hlaváč , Karla Stepanova

Simultaneous Contact Selection and Planning for Contact-Rich Manipulation with Cascaded Optimization

We propose an optimization-based framework for robust contact-rich manipulation. Recent contact-implicit methods enable online hybrid planning across contact modes, allowing closed-loop manipulation for a given target state and contact…

机器人学 · 计算机科学 2026-05-28 Zhe Zhang , Xingrong Diao , Haoxiang Liang , Han Yang , Bi-Ke Zhu , Dandan Zhang , Jiankun Wang

VLM-Based Advanced Rider Assistance System for Motorcycle Safety

Motorcycles face disproportionately high crash risks compared to cars due to limited protection and heightened sensitivity to surface hazards, yet Advanced Rider Assistance Systems (ARAS) remain underdeveloped relative to Advanced Driver…

机器人学 · 计算机科学 2026-05-28 Mohamed Elnoor , Francesca Baldini , Ananya Trivedi , Faizan M. Tariq , Jovin D'sa , David Isele , Sangjae Bae , Dinesh Manocha , Yosuke Sakamoto

SANTS: A State-Adaptive Scheduler for World Action Models

World Action Models (WAMs) improve robot manipulation by using video-based future representations to condition action generation. In pixel-space WAMs, however, the best action condition is not necessarily the fully denoised video.…

机器人学 · 计算机科学 2026-05-28 Yirui Sun , Guangyu Zhuge , Keliang Liu , Jie Gu , Xinyu Bing , Zhongxue Gan , Chunxu Tian

Frequency-Guided Action Diffusion via Sub-Frequency Manifold Traversal

Learning visuomotor policies via behavior cloning typically involves mimicking expert demonstrations collected by human operators. However, natural human demonstrations inherently contain high-frequency noise, such as intermittent jerks,…

机器人学 · 计算机科学 2026-05-28 Junlin Wang

A Surveillance Evasion Game with Continuous Sensor Redeployment via Bilevel Optimization

Uncrewed Aerial Systems (UASs) have become a growing threat to the security of critical infrastructure, exploiting spatiotemporal gaps in sensor perimeters to infiltrate restricted airspace undetected. We formulate this interaction as a…

机器人学 · 计算机科学 2026-05-28 Jaehyeok Kim , Kartik A. Pant , Joseph Kinerson , Kylie Sommer-Kohrt , Worawis Sribunma , Li-Yu Lin , James M. Goppert

S-Cheetah: A Novel Quadrupedal Robot with a 3-DOF Active Spine Learning Agile Locomotion

The biological spine of quadrupeds enables sagittal flexion/extension, lateral bending, and axial rotation, playing a crucial role in highly agile and dexterous locomotion. While numerous studies have integrated active spinal joints into…

机器人学 · 计算机科学 2026-05-28 Zimu Li , Weibang Bai

Tabero: Learning Gentle Manipulation with Closed-Loop Force Feedback from Vision, Touch, and Language

Tactile sensing is essential for robots to achieve human-like gentle manipulation. However, existing Vision-Language-Action (VLA) models struggle to exploit tactile feedback for gentle manipulation due to scarce aligned…

机器人学 · 计算机科学 2026-05-28 Qiwei Wu , Rui Zhang , Xin Xiang , Tao Li , Weihua Zhang , Junjie Lai , Renjing Xu

Turning Video Models into Generalist Robot Policies

Video generative models have emerged as a promising robotics backbone, capable of generating videos that depict the completion of complex tasks across embodiments and environments. Recent work proposes robot foundation models that jointly…

机器人学 · 计算机科学 2026-05-28 Sizhe Lester Li , Evan Kim , Xingjian Bai , Tong Zhao , Tao Pang , Max Simchowitz , Vincent Sitzmann

Colosseum V2: Benchmarking Generalization for Vision Language Action Models

Vision-Language-Action (VLA) models demonstrate promising generalization in robotic manipulation, driven by advances in large-scale vision and language pre-training. This progress can be misleading. Despite the zero-shot perception and…

机器人学 · 计算机科学 2026-05-28 Jeremy Morgan , Prajwal Vijay , Hyeonho Oh , Jincen Song , Ashvin Arora , Alina Du , Gaurav Sukhatme , Jesse Thomason , Ishika Singh