机器人学 — Scifaro

V-VLAPS: Value-Guided Planning for Vision-Language-Action Models

Vision-language-action (VLA) models provide strong action priors for robotic manipulation, but their reactive behavior can fail under distribution shift and long-horizon task structure. Recent VLA-guided planning methods improve execution…

机器人学 · 计算机科学 2026-05-25 Ke Ren , Ali Salamatian , Kieran Pattison , Cyrus Neary

CarlaNCAP: A Framework for Quantifying the Safety of Vulnerable Road Users in Infrastructure-Assisted Collective Perception Using EuroNCAP Scenarios

The growing number of road users has significantly increased the risk of accidents in recent years. Vulnerable Road Users (VRUs) are particularly at risk, especially in urban environments where they are often occluded by parked vehicles or…

机器人学 · 计算机科学 2026-05-25 Jörg Gamerdinger , Sven Teufel , Simon Roller , Oliver Bringmann

LACY: A Vision-Language Model-based Language-Action Cycle for Self-Improving Robotic Manipulation

Learning generalizable policies for robotic manipulation increasingly relies on large-scale models that map language instructions to actions (L2A). However, this one-way paradigm often produces policies that execute tasks without deeper…

机器人学 · 计算机科学 2026-05-25 Youngjin Hong , Houjian Yu , Mingen Li , Changhyun Choi

USIM and U0: A Vision-Language-Action Dataset and Model for General Underwater Robots

Underwater environments pose unique challenges for robotic navigation and manipulation. While existing research has primarily focused on task-specific methods, studies on general-purpose intelligence for multi-task execution remain scarce.…

机器人学 · 计算机科学 2026-05-25 Junwen Gu , Zhiheng Wu , Pengxuan Si , Shuang Qiu , Zhentao Zhang , Yukai Feng , Luoyang Sun , Laien Luo , Lianyi Yu , Jian Wang , Zhengxing Wu

Talk Less, Fly Lighter: Autonomous Semantic Compression for UAV Swarm Communication via LLMs

The rapid adoption of Large Language Models (LLMs) in unmanned systems has significantly enhanced the semantic understanding and autonomous task execution capabilities of Unmanned Aerial Vehicle (UAV) swarms. However, limited communication…

机器人学 · 计算机科学 2026-05-25 Fei Lin , Tengchao Zhang , Qinghua Ni , Jun Huang , Siji Ma , Yonglin Tian , Yisheng Lv , Naiqi Wu

A Reconfigured Wheel-Legged Robot for Enhanced Steering and Adaptability

Wheel-legged robots integrate leg agility on rough terrain with wheel efficiency on flat ground. However, most existing designs do not fully capitalize on the benefits of both legged and wheeled structures, which limits overall system…

机器人学 · 计算机科学 2026-05-25 Zhicheng Song , Jinglan Xu , Chunxin Zheng , Yulin Li , Zhihai Bi , Jun Ma

GAF: Gaussian Action Field as a 4D Representation for Dynamic World Modeling in Robotic Manipulation

Accurate scene perception is critical for vision-based robotic manipulation. Existing approaches typically follow either a Vision-to-Action (V-A) paradigm, predicting actions directly from visual inputs, or a Vision-to-3D-to-Action (V-3D-A)…

机器人学 · 计算机科学 2026-05-25 Ying Chai , Litao Deng , Ruizhi Shao , Jiajun Zhang , Kangchen Lv , Liangjun Xing , Xiang Li , Hongwen Zhang , Yebin Liu

Using Ensemble Diffusion to Estimate Uncertainty for End-to-End Autonomous Driving

End-to-end planning systems for autonomous driving are rapidly improving, especially in closed-loop simulation environments like CARLA. Many such driving systems either do not consider uncertainty as part of the plan itself or obtain it by…

机器人学 · 计算机科学 2026-05-25 Florian Wintel , Sigmund H. Høeg , Gabriel Kiss , Frank Lindseth

AirVista-II: An Agentic System for Embodied UAVs Toward Dynamic Scene Semantic Understanding

Unmanned Aerial Vehicles (UAVs) are increasingly important in dynamic environments such as logistics transportation and disaster response. However, current tasks often rely on human operators to monitor aerial videos and make operational…

机器人学 · 计算机科学 2026-05-25 Fei Lin , Yonglin Tian , Tengchao Zhang , Jun Huang , Sangtian Guan , Fei-Yue Wang

Learning Scene-Level Signed Directional Distance Function with Ellipsoidal Priors and Neural Residuals

Dense reconstruction and differentiable rendering are fundamental tightly connected operations in 3D vision and computer graphics. Recent neural implicit representations demonstrate compelling advantages in reconstruction fidelity and…

机器人学 · 计算机科学 2026-05-25 Zhirui Dai , Hojoon Shin , Yulun Tian , Ki Myung Brian Lee , Nikolay Atanasov

Neural Configuration-Space Barriers for Manipulation Planning and Control

Planning and control for high-dimensional robot manipulators in cluttered dynamic environments require computational efficiency and robust safety guarantees. Inspired by recent advances in learning configuration-space distance functions…

机器人学 · 计算机科学 2026-05-25 Kehan Long , Ki Myung Brian Lee , Nikola Raicevic , Niyas Attasseri , Melvin Leok , Nikolay Atanasov

Data-driven Spatial Classification using Multi-Arm Bandits for Monitoring with Energy-Constrained Mobile Robots

We consider the spatial classification problem for monitoring using data collected by a coordinated team of mobile robots. Such classification problems arise in several applications including search-and-rescue and precision agriculture.…

机器人学 · 计算机科学 2026-05-25 Xiaoshan Lin , Siddharth Nayak , Stefano Di Cairano , Abraham P. Vinod

AwareVLN: Reasoning with Self-awareness for Vision-Language Navigation

Vision-and-Language Navigation (VLN) requires an agent to ground language instructions to its own movement within a visual environment. While state-of-the-art methods leverage the reasoning capabilities of Vision-Language Models (VLMs) for…

机器人学 · 计算机科学 2026-05-22 Wenxuan Guo , Xiuwei Xu , Yichen Liu , Xiangyu Li , Hang Yin , Huangxing Chen , Wenzhao Zheng , Jianjiang Feng , Jie Zhou , Jiwen Lu

GesVLA: Gesture-Aware Vision-Language-Action Model Embedded Representations

Vision-Language-Action (VLA) models have shown strong potential for general-purpose robot manipulation by unifying perception and action. However, existing VLA systems primarily rely on textual instructions and struggle to resolve spatial…

机器人学 · 计算机科学 2026-05-22 Wenxuan Guo , Ziyuan Li , Meng Zhang , Yichen Liu , Yimeng Dong , Chuxi Xu , Yunfei Wei , Ze Chen , Erjin Zhou , Jianjiang Feng

Superhuman Safe and Agile Racing through Multi-Agent Reinforcement Learning

Autonomous systems have achieved superhuman performance in isolation or simulation, yet they remain brittle in shared, dynamic real-world spaces. This failure stems from the dominant single-agent paradigm for physical applications, where…

机器人学 · 计算机科学 2026-05-22 Ismail Geles , Leonard Bauersfeld , Markus Wulfmeier , Davide Scaramuzza

N3P: Accelerated Automated Parking via a Learning-Based Naturalistic Three-Stage Scheme

Autonomous parking requires efficient path planning that ensures kinematic feasibility and collision avoidance in constrained environments. Hybrid A* is widely used but computationally expensive, while reinforcement learning (RL) methods…

机器人学 · 计算机科学 2026-05-22 Yifan Xue , Toktam Mohammadnejad , Faizan M Tariq , Sangjae Bae , David Isele , Yosuke Sakamoto , Nadia Figueroa , Jovin D'sa

Scout-Assisted Planning for Heterogeneous Robot Teams under Partially Known Environments

Autonomous robot teams navigating partially known environments face costly backtracking when ground robots encounter blocked roads that are only revealed upon physical traversal. We address this with Scout-Assisted Planning, a heterogeneous…

机器人学 · 计算机科学 2026-05-22 Hoang-Dung Bui , Abhish Khanal , Raihan Islam Arnob , Gregory J. Stein

Symmetries Here and There, Combined Everywhere: Cross-space Symmetry Compositions in Robotics

Robots exhibit a rich variety of symmetries arising from their mechanical structure and the properties of their tasks. Although many robotics problems exhibit several symmetries simultaneously, existing approaches typically treat them in…

机器人学 · 计算机科学 2026-05-22 Loizos Hadjiloizou , Rodrigo Pérez-Dattari , Noémie Jaquier

SE3Kit: A Lightweight Python Library for Specialized Geometric Primitives in Robotics

The Python robotics ecosystem faces a challenge: while many libraries exist for rigid body transformations, few are both lightweight and mathematically strict. This paper introduces SE3Kit, a lightweight Python library efficient operations…

机器人学 · 计算机科学 2026-05-22 Daniyal Maroufi , Omid Rezayof , Farshid Alambeigi

Decoupling Ego-Motion from Target Dynamics via Dual-Interval Motion Cues for UAV Detection

Object detection from Unmanned Aerial Vehicles (UAVs) is challenged by severe ego-motion, camera jitter, and large scale variations. While modern detectors perform well on static images, their direct application to UAV video often fails,…

机器人学 · 计算机科学 2026-05-22 Liuyang Wang , Feitian Zhang