机器人学 — Scifaro

VEGA: Visual Encoder Grounding Alignment for Spatially-Aware Vision-Language-Action Models

Precise spatial reasoning is fundamental to robotic manipulation, yet the visual backbones of current vision-language-action (VLA) models are predominantly pretrained on 2D image data without explicit 3D geometric supervision, resulting in…

机器人学 · 计算机科学 2026-05-12 Hao Wang , Xiaobao Wei , Jingyang He , Chengyu Bai , Chun-Kai Fan , Jiajun Cao , Jintao Chen , Ying Li , Shanyu Rong , Ming Lu , Xiaozhu Ju , Jian Tang , Shanghang Zhang

Learning Point Cloud Geometry as a Statistical Manifold: Theory and Practice

Point clouds are a fundamental representation for robotic perception tasks such as localization, mapping, and object pose estimation. However, LiDAR-acquired point clouds are inherently sparse and non-uniform, providing incomplete…

机器人学 · 计算机科学 2026-05-12 Jinwoo Lee , Jiwoo Kim , Woojae Shin , Giseop Kim , Hyondong Oh

Nano-U: Efficient Terrain Segmentation for Tiny Robot Navigation

Terrain segmentation is a fundamental capability for autonomous mobile robots operating in unstructured outdoor environments. However, state-of-the-art models are incompatible with the memory and compute constraints typical of…

机器人学 · 计算机科学 2026-05-12 Federico Pizzolato , Francesco Pasti , Nicola Bellotto

Data-Asymmetric Latent Imagination and Reranking for 3D Robotic Imitation Learning

Robotic imitation learning typically assumes access to optimal demonstrations, yet real-world data collection often yields suboptimal, exploratory, or even failed trajectories. Discarding such data wastes valuable information about…

机器人学 · 计算机科学 2026-05-12 Lianghao Luo , Xizhou Bu , Ruyan Liu , Qingqiu Huang , Chufeng Tang , Xiaoshuai Hao , Hongbo Wang , Wei Li

Plan in Sandbox, Navigate in Open Worlds: Learning Physics-Grounded Abstracted Experience for Embodied Navigation

Vision-Language Models (VLMs) have demonstrated exceptional general reasoning capabilities. However, their performance in embodied navigation remains hindered by a scarcity of aligned open-world vision and robot control data. Despite…

机器人学 · 计算机科学 2026-05-12 Zhixuan Shen , Jiawei Du , Ziyu Guo , Han Luo , Lilan Peng , Joey Tianyi Zhou , Haonan Luo , Tianrui Li

A cell-decomposition based path planner for 3D navigation in constrained workspaces

This paper proposes a cell decomposition algorithm for binary occupancy grids that ensures mutual complete visibility from each cell to at least one adjacent cell. This decomposition establishes a simplified framework for verifying path…

机器人学 · 计算机科学 2026-05-12 João P. L. Morais , Luciano C. A. Pimenta , Marcelo A. Santos , Guilherme V. Raffo

EFGCL: Learning Dynamic Motion through Spotting-Inspired External Force Guided Curriculum Learning

Learning dynamic whole-body motions for legged robots through reinforcement learning (RL) remains challenging due to the high risk of failure, which makes efficient exploration difficult and often leads to unstable learning. In this paper,…

机器人学 · 计算机科学 2026-05-12 Keita Yoneda , Kento Kawaharazuka , Kei Okada

Guided Streaming Stochastic Interpolant Policy

Inference-time guidance is essential for steering generative robot policies toward dynamic objectives without retraining, yet existing methods are largely confined to chunk-based architectures that exhibit high latency and lack the…

机器人学 · 计算机科学 2026-05-12 Puming Jiang , Meiyi Wang , Kelvin Lin , Ce Hao , Harold Soh

Beyond Self-Play and Scale: A Behavior Benchmark for Generalization in Autonomous Driving

Recent Autonomous Driving (AD) works such as GigaFlow and PufferDrive have unlocked Reinforcement Learning (RL) at scale as a training strategy for driving policies. Yet such policies remain disconnected from established benchmarks, leaving…

机器人学 · 计算机科学 2026-05-12 Aron Distelzweig , Faris Janjoš , Andreas Look , Anna Rothenhäusler , Daniel Jost , Oliver Scheel , Raghu Rajan , Daphne Cornelisse , Eugene Vinitsky , Joschka Boedecker

Muninn: Your Trajectory Diffusion Model But Faster

Diffusion-based trajectory planners can synthesize rich, multimodal robot motions, but their iterative denoising makes online planning and control prohibitively slow. Existing accelerations either modify the sampler or compress the…

机器人学 · 计算机科学 2026-05-12 Gokul Puthumanaillam , Hao Jiang , Ruben Hernandez , Jose Fuentes , Paulo Padrao , Leonardo Bobadilla , Melkior Ornik

StereoPolicy: Improving Robotic Manipulation Policies via Stereo Perception

Recent advances in robot imitation learning have yielded powerful visuomotor policies capable of manipulating a wide variety of objects directly from monocular visual inputs. However, monocular observations inherently lack reliable depth…

机器人学 · 计算机科学 2026-05-12 Evans Han , Yunfan Jiang , Yingke Wang , Haoyue Xiao , Huang Huang , Jianwen Xie , Jiajun Wu , Li Fei-Fei , Ruohan Zhang

HiDrive: A Closed-Loop Benchmark for High-Level Autonomous Driving

End-to-end autonomous driving has witnessed rapid progress, yet existing benchmarks are increasingly saturated, with state-of-the-art models achieving near-perfect scores on widely used open-loop and closed-loop benchmarks. This saturation…

机器人学 · 计算机科学 2026-05-12 Zhongyu Xia , Guanyu Zhu , Guo Tang , Wenhao Chen , Yongtao Wang

JODA: Composable Joint Dynamics for Articulated Objects

Articulated objects used in simulation and embodied AI are typically specified by geometry and kinematic structure, but lack the fine-grained dynamical effects that govern realistic mechanical behavior, such as frictional holding, detents,…

机器人学 · 计算机科学 2026-05-12 Tianhong Gao , Cheng Yu , Yinghao Xu , Mengyu Chu

Explicit Stair Geometry Conditioning for Robust Humanoid Locomotion

Robust humanoid stair climbing remains challenging due to geometric discontinuities, sensitivity to step height variations, and perception uncertainty in real-world environments. Existing learning-based locomotion policies often rely on…

机器人学 · 计算机科学 2026-05-12 Jianguo Zhang , Wentai Xu , Shusheng Ye , Yuxiang He , Weimin Qi , Qinbo Sun , Ning Ding , Liguang Zhou

Neural Distance-Guided Path Integral Control for Tractor-Trailer Navigation

Autonomous and safe navigation of tractor-trailer systems requires accurate, real-time collision avoidance and dynamically feasible control, particularly in cluttered and complex agricultural environments. This is challenging due to their…

机器人学 · 计算机科学 2026-05-12 Peng Wei , Chen Peng , Stavros Vougioukas

Network-Efficient World Model Token Streaming

Generative driving world models rely on compact latent state representations that must be efficiently transmitted and synchronized across distributed compute and connected vehicles. We study network-efficient streaming of a discrete world…

机器人学 · 计算机科学 2026-05-12 Shatadal Mishra , Ahmadreza Moradipari , Nejib Ammar

Above and Below: Heterogeneous Multi-robot SLAM Across Surface and Underwater Domains

Multi-robot simultaneous localization and mapping (SLAM) is a fundamental task in multi-robot operations. Robots must have a common understanding of their location and that of their team members to complete coordinated actions. However,…

机器人学 · 计算机科学 2026-05-12 John McConnell , Armon Shariati , Paul Szenher , Yaxuan Li

Efficient Multi-Robot Motion Planning with Precomputed Translation-Invariant Edge Bundles

Solving multi-robot motion planning (MRMP) requires generating collision-free kinodynamically feasible trajectories for multiple interacting robots. We introduce Kinodynamic Translation-Invariant Edge Bundles or KiTE-Extend, a…

机器人学 · 计算机科学 2026-05-12 Himanshu Gupta , Paul Motter , Aritra Chakrabarty , Rishabh Sodani , Srikrishna Bangalore Raghu , Alessandro Roncone , Bradley Hayes , Zachary Sunberg

Zero-Shot Sim-to-Real Robot Learning: A Dexterous Manipulation Study on Reactive Catching

Dexterous manipulation is physics-intensive and highly sensitive to modeling errors and perception noise, making sim-to-real transfer prohibitively challenging. Domain randomization (DR) is commonly used to improve the robustness of learned…

机器人学 · 计算机科学 2026-05-12 Kejia Ren , Gaotian Wang , Andrew S. Morgan , Kaiyu Hang

MVB-Grasp: Minimum-Volume-Box Filtering of Diffusion-based Grasps for Frontal Manipulation

State-of-the-art 6-DoF grasp generators excel on tabletop benchmarks with overhead cameras but struggle in frontal grasping scenarios on low-cost manipulators with constrained workspaces, where kinematic limits and approach-direction…

机器人学 · 计算机科学 2026-05-12 Bibek Poudel , Abdul Basit , Muhammad Shafique