机器人学 — Scifaro

A Hybrid Neural-Assisted Unscented Kalman Filter for Unmanned Ground Vehicle Navigation

Modern autonomous navigation for unmanned ground vehicles relies on different estimators to fuse inertial sensors and GNSS measurements. However, the constant noise covariance matrices often struggle to account for dynamic real-world…

机器人学 · 计算机科学 2026-03-26 Gal Versano , Itzik Klein

Xiaomi-Robotics-0: An Open-Sourced Vision-Language-Action Model with Real-Time Execution

In this report, we introduce Xiaomi-Robotics-0, an advanced vision-language-action (VLA) model optimized for high performance and fast and smooth real-time execution. The key to our method lies in a carefully designed training recipe and…

机器人学 · 计算机科学 2026-03-26 Rui Cai , Jun Guo , Xinze He , Piaopiao Jin , Jie Li , Bingxuan Lin , Futeng Liu , Wei Liu , Fei Ma , Kun Ma , Feng Qiu , Heng Qu , Yifei Su , Qiao Sun , Dong Wang , Donghao Wang , Yunhong Wang , Rujie Wu , Diyun Xiang , Yu Yang , Hangjun Ye , Yuan Zhang , Quanyun Zhou

Point Bridge: 3D Representations for Cross Domain Policy Learning

Robot foundation models are beginning to deliver on the promise of generalist robotic agents, yet progress remains constrained by the scarcity of large-scale real-world manipulation datasets. Simulation and synthetic data generation offer a…

机器人学 · 计算机科学 2026-03-26 Siddhant Haldar , Lars Johannsmeier , Lerrel Pinto , Abhishek Gupta , Dieter Fox , Yashraj Narang , Ajay Mandlekar

E0: Enhancing Generalization and Fine-Grained Control in VLA Models via Tweedie Discrete Diffusion

Vision-Language-Action (VLA) models offer a unified framework for robotic manipulation by integrating visual perception, language understanding, and control generation. However, existing VLA systems still struggle to generalize across…

机器人学 · 计算机科学 2026-03-26 Zhihao Zhan , Jiaying Zhou , Likui Zhang , Qinhan Lv , Hao Liu , Jusheng Zhang , Weizheng Li , Ziliang Chen , Tianshui Chen , Ruifeng Zhai , Keze Wang , Liang Lin , Guangrun Wang

Unified Diffusion VLA: Vision-Language-Action Model via Joint Discrete Denoising Diffusion Process

Vision-language-action (VLA) models aim to understand natural language instructions and visual observations and to execute corresponding actions as an embodied agent. Recent work integrates future images into the understanding-acting loop,…

机器人学 · 计算机科学 2026-03-26 Jiayi Chen , Wenxuan Song , Pengxiang Ding , Ziyang Zhou , Han Zhao , Feilong Tang , Donglin Wang , Haoang Li

ACG: Action Coherence Guidance for Flow-based Vision-Language-Action models

Diffusion and flow matching models have emerged as powerful robot policies, enabling Vision-Language-Action (VLA) models to generalize across diverse scenes and instructions. Yet, when trained via imitation learning, their high generative…

机器人学 · 计算机科学 2026-03-26 Minho Park , Kinam Kim , Junha Hyung , Hyojin Jang , Hoiyeong Jin , Jooyeol Yun , Hojoon Lee , Jaegul Choo

Autonomous Legged Mobile Manipulation for Lunar Surface Operations via Constrained Reinforcement Learning

Robotics plays a pivotal role in planetary science and exploration, where autonomous and reliable systems are crucial due to the risks and challenges inherent to space environments. The establishment of permanent lunar bases demands robotic…

机器人学 · 计算机科学 2026-03-26 Alvaro Belmonte-Baeza , Miguel Cazorla , Gabriel J. García , Carlos J. Pérez-Del-Pulgar , Jorge Pomares

Rotor-Failure-Aware Quadrotors Flight in Unknown Environments

Rotor failures in quadrotors may result in high-speed rotation and vibration due to rotor imbalance, which introduces significant challenges for autonomous flight in unknown environments. The mainstream approaches against rotor failures…

机器人学 · 计算机科学 2026-03-26 Xiaobin Zhou , Miao Wang , Chengao Li , Can Cui , Ruibin Zhang , Yongchao Wang , Chao Xu , Fei Gao

MiniBEE: A New Form Factor for Compact Bimanual Dexterity

Bimanual robot manipulators can achieve impressive dexterity, but typically rely on two full six- or seven- degree-of-freedom arms so that paired grippers can coordinate effectively. This traditional framework increases system complexity…

机器人学 · 计算机科学 2026-03-26 Sharfin Islam , Zewen Chen , Zhanpeng He , Swapneel Bhatt , Andres Permuy , Brock Taylor , James Vickery , Zhengbin Lu , Cheng Zhang , Pedro Piacenza , Matei Ciocarlie

Memory-Augmented Potential Field Theory: A Framework for Adaptive Control in Non-Convex Domains

Stochastic optimal control methods often struggle in complex non-convex landscapes, frequently becoming trapped in local optima due to their inability to learn from historical trajectory data. This paper introduces Memory-Augmented…

机器人学 · 计算机科学 2026-03-26 Dongzhe Zheng , Wenjie Mei

Reward Evolution with Graph-of-Thoughts: A Bi-Level Language Model Framework for Reinforcement Learning

Designing effective reward functions remains a major challenge in reinforcement learning (RL), often requiring considerable human expertise and iterative refinement. Recent advances leverage Large Language Models (LLMs) for automated reward…

机器人学 · 计算机科学 2026-03-26 Changwei Yao , Xinzi Liu , Chen Li , Marios Savvides

Learning collision risk proactively from naturalistic driving data at scale

Accurately and proactively alerting drivers or automated systems to emerging collisions is crucial for road safety, particularly in highly interactive and complex urban environments. Existing methods either require labour-intensive…

机器人学 · 计算机科学 2026-03-26 Yiru Jiao , Simeon C. Calvert , Sander van Cranenburgh , Hans van Lint

Sim-to-Real of Humanoid Locomotion Policies via Joint Torque Space Perturbation Injection

This paper proposes a novel alternative to existing sim-to-real methods for training control policies with simulated experiences. Prior sim-to-real methods for legged robots mostly rely on the domain randomization approach, where a fixed…

机器人学 · 计算机科学 2026-03-26 Junhyeok Rui Cha , Woohyun Cha , Jaeyong Shin , Donghyeon Kim , Jaeheung Park

KINESIS: Motion Imitation for Human Musculoskeletal Locomotion

How do humans move? Advances in reinforcement learning (RL) have produced impressive results in capturing human motion using physics-based humanoid control. However, torque-controlled humanoids fail to model key aspects of human motor…

机器人学 · 计算机科学 2026-03-26 Merkourios Simos , Alberto Silvio Chiappa , Alexander Mathis

Dynamic Neural Potential Field: Online Trajectory Optimization in the Presence of Moving Obstacles

Generalist robot policies must operate safely and reliably in everyday human environments such as homes, offices, and warehouses, where people and objects move unpredictably. We present Dynamic Neural Potential Field (NPField-GPT), a…

机器人学 · 计算机科学 2026-03-26 Aleksei Staroverov , Muhammad Alhaddad , Aditya Narendra , Konstantin Mironov , Aleksandr Panov

DIDLM: A SLAM Dataset for Difficult Scenarios Featuring Infrared, Depth Cameras, LIDAR, 4D Radar, and Others under Adverse Weather, Low Light Conditions, and Rough Roads

Adverse weather conditions, low-light environments, and bumpy road surfaces pose significant challenges to SLAM in robotic navigation and autonomous driving. Existing datasets in this field predominantly rely on single sensors or…

机器人学 · 计算机科学 2026-03-26 Weisheng Gong , Chen He , Kaijie Su , Qingyong Li , Tong Wu , Z. Jane Wang

VTAM: Video-Tactile-Action Models for Complex Physical Interaction Beyond VLAs

Video-Action Models (VAMs) have emerged as a promising framework for embodied intelligence, learning implicit world dynamics from raw video streams to produce temporally consistent action predictions. Although such models demonstrate strong…

机器人学 · 计算机科学 2026-03-25 Haoran Yuan , Weigang Yi , Zhenyu Zhang , Wendi Chen , Yuchen Mo , Jiashi Yin , Xinzhuo Li , Xiangyu Zeng , Chuan Wen , Cewu Lu , Katherine Driggs-Campbell , Ismini Lourentzou

Rectify, Don't Regret: Avoiding Pitfalls of Differentiable Simulation in Trajectory Prediction

Current open-loop trajectory models struggle in real-world autonomous driving because minor initial deviations often cascade into compounding errors, pushing the agent into out-of-distribution states. While fully differentiable closed-loop…

机器人学 · 计算机科学 2026-03-25 Harsh Yadav , Christian Bohn , Tobias Meisen

PinPoint: Monocular Needle Pose Estimation for Robotic Suturing via Stein Variational Newton and Geometric Residuals

Reliable estimation of surgical needle 3D position and orientation is essential for autonomous robotic suturing, yet existing methods operate almost exclusively under stereoscopic vision. In monocular endoscopic settings, common in…

机器人学 · 计算机科学 2026-03-25 Jesse F. d'Almeida , Tanner Watts , Susheela Sharma Stern , James Ferguson , Alan Kuntz , Robert J. Webster

Edge Radar Material Classification Under Geometry Shifts

Material awareness can improve robotic navigation and interaction, particularly in conditions where cameras and LiDAR degrade. We present a lightweight mmWave radar material classification pipeline designed for ultra-low-power edge devices…

机器人学 · 计算机科学 2026-03-25 Jannik Hohmann , Dong Wang , Andreas Nüchter