机器人学 — Scifaro

FD-VLA: Force-Distilled Vision-Language-Action Model for Contact-Rich Manipulation

Force sensing is a crucial modality for Vision-Language-Action (VLA) frameworks, as it enables fine-grained perception and dexterous manipulation in contact-rich tasks. We present Force-Distilled VLA (FD-VLA), a novel framework that…

机器人学 · 计算机科学 2026-03-23 Ruiteng Zhao , Wenshuo Wang , Yicheng Ma , Xiaocong Li , Francis E. H. Tay , Marcelo H. Ang , Haiyue Zhu

Safe Path Planning and Observation Quality Enhancement Strategy for Unmanned Aerial Vehicles in Water Quality Monitoring Tasks

Unmanned Aerial Vehicle (UAV) spectral remote sensing technology is widely used in water quality monitoring. However, in dynamic environments, varying illumination conditions, such as shadows and specular reflection (sun glint), can cause…

机器人学 · 计算机科学 2026-03-23 Yuanshuang Fu , Qianyao Wang , Qihao Wang , Bonan Zhang , Jiaxin Zhao , Yiming Cao , Zhijun Li

FORWARD: Dataset of a forwarder operating in rough terrain

We present FORWARD, a high-resolution multimodal dataset of a cut-to-length forwarder operating in rough terrain on two harvest sites in the middle part of Sweden. The forwarder is a large Komatsu model equipped with vehicle telematics…

机器人学 · 计算机科学 2026-03-23 Mikael Lundbäck , Erik Wallin , Carola Häggström , Mattias Nyström , Andreas Grönlund , Mats Richardson , Petrus Jönsson , William Arnvik , Lucas Hedström , Arvid Fälldin , Martin Servin

RobotArena $\infty$: Scalable Robot Benchmarking via Real-to-Sim Translation

The pursuit of robot generalists, agents capable of performing diverse tasks across diverse environments, demands rigorous and scalable evaluation. Yet real-world testing of robot policies remains fundamentally constrained: it is…

机器人学 · 计算机科学 2026-03-23 Yash Jangir , Yidi Zhang , Pang-Chi Lo , Kashu Yamazaki , Chenyu Zhang , Kuan-Hsun Tu , Tsung-Wei Ke , Lei Ke , Yonatan Bisk , Katerina Fragkiadaki

SpikeGrasp: A Benchmark for 6-DoF Grasp Pose Detection from Stereo Spike Streams

Most robotic grasping systems rely on converting sensor data into explicit 3D point clouds, which is a computational step not found in biological intelligence. This paper explores a fundamentally different, neuro-inspired paradigm for 6-DoF…

机器人学 · 计算机科学 2026-03-23 Zhuoheng Gao , Jiyao Zhang , Zhiyong Xie , Hao Dong , Zhaofei Yu , Rongmei Chen , Guozhang Chen , Tiejun Huang

Mash, Spread, Slice! Learning to Manipulate Object States via Visual Spatial Progress

Most robot manipulation focuses on changing the kinematic state of objects: picking, placing, opening, or rotating them. However, a wide range of real-world manipulation tasks involve a different class of object state change--such as…

机器人学 · 计算机科学 2026-03-23 Priyanka Mandikal , Jiaheng Hu , Shivin Dass , Sagnik Majumder , Roberto Martín-Martín , Kristen Grauman

Uncertainty-Aware Multi-Robot Task Allocation With Strongly Coupled Inter-Robot Rewards

Allocating tasks to heterogeneous robot teams in environments with uncertain task requirements is a fundamentally challenging problem. Redundantly assigning multiple robots to such tasks is overly conservative, while purely reactive…

机器人学 · 计算机科学 2026-03-23 Ben Rossano , Jaein Lim , Jonathan P. How

World4RL: Diffusion World Models for Policy Refinement with Reinforcement Learning for Robotic Manipulation

Robotic manipulation policies are commonly initialized through imitation learning, but their performance is limited by the scarcity and narrow coverage of expert data. Reinforcement learning can refine polices to alleviate this limitation,…

机器人学 · 计算机科学 2026-03-23 Zhennan Jiang , Kai Liu , Yuxin Qin , Shuai Tian , Yupeng Zheng , Mingcai Zhou , Chao Yu , Haoran Li , Dongbin Zhao

Learning Discrete Abstractions for Visual Rearrangement Tasks Using Vision-Guided Graph Coloring

Learning abstractions directly from data is a core challenge in robotics. Humans naturally operate at an abstract level, reasoning over high-level subgoals while delegating execution to low-level motor skills -- an ability that enables…

机器人学 · 计算机科学 2026-03-23 Abhiroop Ajith , Constantinos Chamzas

Adaptive Relative Pose Estimation Framework with Dual Noise Tuning for Safe Approaching Maneuvers

Accurate and robust relative pose estimation is crucial for enabling challenging Active Debris Removal (ADR) missions targeting tumbling derelict satellites such as ESA's ENVISAT. This work presents a complete pipeline integrating advanced…

机器人学 · 计算机科学 2026-03-23 Batu Candan , Murat Berke Oktay , Simone Servadio

CoInfra: A Large-Scale Cooperative Infrastructure Perception System and Dataset for Vehicle-Infrastructure Cooperation in Adverse Weather

Vehicle-infrastructure (V2I) cooperative perception can substantially extend the range, coverage, and robustness of autonomous driving systems beyond the limits of onboard-only sensing, particularly in occluded and adverse-weather…

机器人学 · 计算机科学 2026-03-23 Minghao Ning , Yufeng Yang , Keqi Shu , Shucheng Huang , Jiaming Zhong , Maryam Salehi , Mahdi Rahmani , Jiaming Guo , Yukun Lu , Chen Sun , Aladdin Saleh , Ehsan Hashemi , Amir Khajepour

Latent Action Diffusion for Cross-Embodiment Manipulation

End-to-end learning is emerging as a powerful paradigm for robotic manipulation, but its effectiveness is limited by data scarcity and the heterogeneity of action spaces across robot embodiments. In particular, diverse action spaces across…

机器人学 · 计算机科学 2026-03-23 Erik Bauer , Elvis Nava , Robert K. Katzschmann

Pseudo-Simulation for Autonomous Driving

Existing evaluation paradigms for Autonomous Vehicles (AVs) face critical limitations. Real-world evaluation is often challenging due to safety concerns and a lack of reproducibility, whereas closed-loop simulation can face insufficient…

机器人学 · 计算机科学 2026-03-23 Wei Cao , Marcel Hallgarten , Tianyu Li , Daniel Dauner , Xunjiang Gu , Caojun Wang , Yakov Miron , Marco Aiello , Hongyang Li , Igor Gilitschenski , Boris Ivanovic , Marco Pavone , Andreas Geiger , Kashyap Chitta

Spectral Normalization for Lipschitz-Constrained Policies on Learning Humanoid Locomotion

Reinforcement learning (RL) has shown great potential in training agile and adaptable controllers for legged robots, enabling them to learn complex locomotion behaviors directly from experience. However, policies trained in simulation often…

机器人学 · 计算机科学 2026-03-23 Jaeyong Shin , Woohyun Cha , Donghyeon Kim , Junhyeok Cha , Jaeheung Park

RL-based Control of UAS Subject to Significant Disturbance

This paper proposes a Reinforcement Learning (RL)-based control framework for position and attitude control of an Unmanned Aerial System (UAS) subjected to significant disturbance that can be associated with an uncertain trigger signal. The…

机器人学 · 计算机科学 2026-03-23 Kousheek Chakraborty , Thijs Hof , Ayham Alharbat , Abeje Mersha

From Vocal Instructions to Household Tasks: The Inria TIAGo++ in the euROBIN Service Robots Coopetition

This paper describes the Inria team's integrated robotics system used in the 1st euROBIN coopetition, during which service robots performed voice-activated household tasks in a kitchen setting. The team developed a modified TIAGo++ platform…

机器人学 · 计算机科学 2026-03-23 Fabio Amadio , Clemente Donoso , Dionis Totsila , Raphael Lorenzo , Quentin Rouxel , Olivier Rochel , Enrico Mingo Hoffman , Jean-Baptiste Mouret , Serena Ivaldi

EgoSpot:Egocentric Multimodal Control for Hands-Free Mobile Manipulation

We propose a novel hands-free control framework for the Boston Dynamics Spot robot using the Microsoft HoloLens 2 mixed-reality headset. Enabling accessible robot control is critical for allowing individuals with physical disabilities to…

机器人学 · 计算机科学 2026-03-23 Ganlin Zhang , Deheng Zhang , Longteng Duan , Guo Han , Yuqian Fu , Danda Pani Paudel , Luc Van Gool , Eric Vollenweider

Not All Features Are Created Equal: A Mechanistic Study of Vision-Language-Action Models

Vision-Language-Action (VLA) models combine perception, language, and motor control in a single architecture, yet how they translate multimodal inputs into actions remains poorly understood. We apply activation injection, sparse…

机器人学 · 计算机科学 2026-03-20 Bryce Grant , Xijia Zhao , Peng Wang

NavTrust: Benchmarking Trustworthiness for Embodied Navigation

There are two major categories of embodied navigation: Vision-Language Navigation (VLN), where agents navigate by following natural language instructions; and Object-Goal Navigation (OGN), where agents navigate to a specified target object.…

机器人学 · 计算机科学 2026-03-20 Huaide Jiang , Yash Chaudhary , Yuping Wang , Zehao Wang , Raghav Sharma , Manan Mehta , Yang Zhou , Lichao Sun , Zhiwen Fan , Zhengzhong Tu , Jiachen Li

Sparse Autoencoders Reveal Interpretable and Steerable Features in VLA Models

Vision-Language-Action (VLA) models have emerged as a promising approach for general-purpose robot manipulation. However, their generalization is inconsistent: while these models can perform impressively in some settings, fine-tuned…

机器人学 · 计算机科学 2026-03-20 Aiden Swann , Lachlain McGranahan , Hugo Buurmeijer , Monroe Kennedy , Mac Schwager