机器人学 — Scifaro

Generalizable task-oriented object grasping through LLM-guided ontology and similarity-based planning

Task-oriented grasping (TOG) is more challenging than simple object grasping because it requires precise identification of object parts and careful selection of grasping areas to ensure effective and robust manipulation. While recent…

机器人学 · 计算机科学 2026-03-30 Hao Chen , Takuya Kiyokawa , Weiwei Wan , Kensuke Harada

T-800: An 800 Hz Data Glove for Precise Hand Gesture Tracking

Human dexterity relies on rapid, sub-second motor adjustments, yet capturing these high-frequency dynamics remains an enduring challenge in biomechanics and robotics. Existing motion capture paradigms are compromised by a trade-off between…

机器人学 · 计算机科学 2026-03-30 Haoyang Luo , Zihang Zhao , Leiyao Cui , Saiyao Zhang , Liu Yang , Zhi Han , Xiyuan Tang , Yixin Zhu

Realtime-VLA V2: Learning to Run VLAs Fast, Smooth, and Accurate

In deployment of the VLA models to real-world robotic tasks, execution speed matters. In previous work arXiv:2510.26742 we analyze how to make neural computation of VLAs on GPU fast. However, we leave the question of how to actually deploy…

机器人学 · 计算机科学 2026-03-30 Chen Yang , Yucheng Hu , Yunchao Ma , Yunhuan Yang , Jing Tan , Haoqiang Fan

Optimal Prioritized Dissipation and Closed-Form Damping Limitation under Actuator Constraints for Haptic Interfaces

In haptics, guaranteeing stability is essential to ensure safe interaction with remote or virtual environments. One of the most relevant methods at the state-of-the-art is the Time Domain Passivity Approach (TDPA). However, its high…

机器人学 · 计算机科学 2026-03-30 Camilla Celli , Andrea Bini , Valerio Novelli , Alessandro Filippeschi , Francesco Porcini , Antonio Frisoli

DiffusionAnything: End-to-End In-context Diffusion Learning for Unified Navigation and Pre-Grasp Motion

Efficiently predicting motion plans directly from vision remains a fundamental challenge in robotics, where planning typically requires explicit goal specification and task-specific design. Recent vision-language-action (VLA) models infer…

机器人学 · 计算机科学 2026-03-30 Iana Zhura , Yara Mahmoud , Jeffrin Sam , Hung Khang Nguyen , Didar Seyidov , Miguel Altamirano Cabrera , Dzmitry Tsetserukou

Line-of-Sight-Constrained Multi-Robot Mapless Navigation via Polygonal Visible Regions

Multi-robot systems rely on underlying connectivity to ensure reliable communication and timely coordination. This paper studies the line-of-sight (LoS) connectivity maintenance problem in multi-robot navigation with unknown obstacles.…

机器人学 · 计算机科学 2026-03-30 Ruofei Bai , Shenghai Yuan , Xinhang Xu , Xingyu Ji , Xiaowei Li , Hongliang Guo , Wei-Yun Yau , Lihua Xie

Policy-Guided World Model Planning for Language-Conditioned Visual Navigation

Navigating to a visually specified goal given natural language instructions remains a fundamental challenge in embodied AI. Existing approaches either rely on reactive policies that struggle with long-horizon planning, or employ world…

机器人学 · 计算机科学 2026-03-30 Amirhosein Chahe , Lifeng Zhou

Can Vision Foundation Models Navigate? Zero-Shot Real-World Evaluation and Lessons Learned

Visual Navigation Models (VNMs) promise generalizable, robot navigation by learning from large-scale visual demonstrations. Despite growing real-world deployment, existing evaluations rely almost exclusively on success rate, whether the…

机器人学 · 计算机科学 2026-03-30 Maeva Guerrier , Karthik Soma , Jana Pavlasek , Giovanni Beltrame

Chasing Autonomy: Dynamic Retargeting and Control Guided RL for Performant and Controllable Humanoid Running

Humanoid robots have the promise of locomoting like humans, including fast and dynamic running. Recently, reinforcement learning (RL) controllers that can mimic human motions have become popular as they can generate very dynamic behaviors,…

机器人学 · 计算机科学 2026-03-30 Zachary Olkin , William D. Compton , Ryan M. Bena , Aaron D. Ames

Massive Parallel Deep Reinforcement Learning for Active SLAM

Recent advances in parallel computing and GPU acceleration have created new opportunities for computation-intensive learning problems such as Active SLAM -- where actions are selected to reduce uncertainty and improve joint mapping and…

机器人学 · 计算机科学 2026-03-30 Martín Arce Llobera , Julio A. Placed , Mariano De Paula , Pablo De Cristóforis

ETA-VLA: Efficient Token Adaptation via Temporal Fusion and Intra-LLM Sparsification for Vision-Language-Action Models

The integration of Vision-Language-Action (VLA) models into autonomous driving systems offers a unified framework for interpreting complex scenes and executing control commands. However, the necessity to incorporate historical multi-view…

机器人学 · 计算机科学 2026-03-30 Yiru Wang , Anqing Jiang , Shuo Wang , Yuwen Heng , Zichong Gu , Hao Sun

MMaDA-VLA: Large Diffusion Vision-Language-Action Model with Unified Multi-Modal Instruction and Generation

Vision-Language-Action (VLA) models aim to control robots for manipulation from visual observations and natural-language instructions. However, existing hierarchical and autoregressive paradigms often introduce architectural overhead,…

机器人学 · 计算机科学 2026-03-30 Yang Liu , Pengxiang Ding , Tengyue Jiang , Xudong Wang , Wenxuan Song , Minghui Lin , Han Zhao , Hongyin Zhang , Zifeng Zhuang , Wei Zhao , Siteng Huang , Jinkui Shi , Donglin Wang

Learning Rollout from Sampling:An R1-Style Tokenized Traffic Simulation Model

Learning diverse and high-fidelity traffic simulations from human driving demonstrations is crucial for autonomous driving evaluation. The recent next-token prediction (NTP) paradigm, widely adopted in large language models (LLMs), has been…

机器人学 · 计算机科学 2026-03-30 Ziyan Wang , Peng Chen , Ding Li , Chiwei Li , Qichao Zhang , Zhongpu Xia , Guizhen Yu

SOMA: Strategic Orchestration and Memory-Augmented System for Vision-Language-Action Model Robustness via In-Context Adaptation

Despite the promise of Vision-Language-Action (VLA) models as generalist robotic controllers, their robustness against perceptual noise and environmental variations in out-of-distribution (OOD) tasks remains fundamentally limited by the…

机器人学 · 计算机科学 2026-03-30 Zhuoran Li , Zhiyang Li , Kaijun Zhou , Jinyu Gu

Can a Robot Walk the Robotic Dog: Triple-Zero Collaborative Navigation for Heterogeneous Multi-Agent Systems

We present Triple Zero Path Planning (TZPP), a collaborative framework for heterogeneous multi-robot systems that requires zero training, zero prior knowledge, and zero simulation. TZPP employs a coordinator--explorer architecture: a…

机器人学 · 计算机科学 2026-03-30 Yaxuan Wang , Yifan Xiang , Ke Li , Xun Zhang , BoWen Ye , Zhuochen Fan , Fei Wei , Tong Yang

An Efficient Closed-Form Solution to Full Visual-Inertial State Initialization

In this letter, we present a closed-form initialization method that recovers the full visual-inertial state without nonlinear optimization. Unlike previous approaches that rely on iterative solvers, our formulation yields analytical,…

机器人学 · 计算机科学 2026-03-30 Samuel Cerezo , Seong Hun Lee , Javier Civera

Towards Automated Chicken Deboning via Learning-based Dynamically-Adaptive 6-DoF Multi-Material Cutting

Automating chicken shoulder deboning requires precise 6-DoF cutting through a partially occluded, deformable, multi-material joint, since contact with the bones presents serious health and safety risks. Our work makes both systems-level and…

机器人学 · 计算机科学 2026-03-30 Zhaodong Yang , Ai-Ping Hu , Harish Ravichandar

VG-Mapping: Variation-aware Density Control for Online 3D Gaussian Mapping in Semi-static Scenes

Maintaining an up-to-date map that accurately reflects recent changes in the environment is crucial, especially for robots that repeatedly traverse the same space. Failing to promptly update the changed regions can degrade map quality,…

机器人学 · 计算机科学 2026-03-30 Yicheng He , Jingwen Yu , Guangcheng Chen , Hong Zhang

A Narwhal-Inspired Sensing-to-Control Framework for Small Fixed-Wing Aircraft

Fixed-wing unmanned aerial vehicles (UAVs) offer endurance and efficiency but lack low-speed agility due to highly coupled dynamics. We present an end-to-end sensing-to-control pipeline that combines bio-inspired hardware, physics-informed…

机器人学 · 计算机科学 2026-03-30 Fengze Xie , Xiaozhou Fan , Jacob Schuster , Yisong Yue , Morteza Gharib

HELIOS: Hierarchical Exploration for Language-Grounded Interaction in Open Scenes

Language-specified mobile manipulation tasks in novel environments simultaneously face challenges interacting with a scene which is only partially observed, grounding semantic information from language instructions to the partially observed…

机器人学 · 计算机科学 2026-03-30 Katrina Ashton , Chahyon Ku , Shrey Shah , Saumit Vedula , Tingrui Zhang , Wen Jiang , Kostas Daniilidis , Bernadette Bucher