机器人学 — Scifaro

Task Parameter Extrapolation via Learning Inverse Tasks from Forward Demonstrations

Generalizing skill policies to novel conditions remains a key challenge in robot learning. Imitation learning methods, while data-efficient, are largely confined to the training region and consistently fail on input data outside it, leading…

机器人学 · 计算机科学 2026-03-10 Serdar Bahar , Fatih Dogangun , Matteo Saveriano , Yukie Nagai , Emre Ugur

MEM: Multi-Scale Embodied Memory for Vision Language Action Models

Conventionally, memory in end-to-end robotic learning involves inputting a sequence of past observations into the learned policy. However, in complex multi-stage real-world tasks, the robot's memory must represent past events at multiple…

机器人学 · 计算机科学 2026-03-10 Marcel Torne , Karl Pertsch , Homer Walke , Kyle Vedder , Suraj Nair , Brian Ichter , Allen Z. Ren , Haohuan Wang , Jiaming Tang , Kyle Stachowicz , Karan Dhabalia , Michael Equi , Quan Vuong , Jost Tobias Springenberg , Sergey Levine , Chelsea Finn , Danny Driess

$\pi$-StepNFT: Wider Space Needs Finer Steps in Online RL for Flow-based VLAs

Flow-based vision-language-action (VLA) models excel in embodied control but suffer from intractable likelihoods during multi-step sampling, hindering online reinforcement learning. We propose \textbf{\textit{$\boldsymbol{\pi}$-StepNFT}}…

机器人学 · 计算机科学 2026-03-10 Siting Wang , Xiaofeng Wang , Zheng Zhu , Minnan Pei , Xinyu Cui , Cheng Deng , Jian Zhao , Guan Huang , Haifeng Zhang , Jun Wang

Iterative Closed-Loop Motion Synthesis for Scaling the Capabilities of Humanoid Control

Physics-based humanoid control relies on training with motion datasets that have diverse data distributions. However, the fixed difficulty distribution of datasets limits the performance ceiling of the trained control policies.…

机器人学 · 计算机科学 2026-03-10 Weisheng Xu , Qiwei Wu , Jiaxi Zhang , Tan Jing , Yangfan Li , Yuetong Fang , Jiaqi Xiong , Kai Wu , Rong Ou , Renjing Xu

OVerSeeC: Open-Vocabulary Costmap Generation from Satellite Images and Natural Language

Aerial imagery provides essential global context for autonomous navigation, enabling route planning at scales inaccessible to onboard sensing. We address the problem of generating global costmaps for long-range planning directly from…

机器人学 · 计算机科学 2026-03-10 Rwik Rana , Jesse Quattrociocchi , Dongmyeong Lee , Christian Ellis , Amanda Adkins , Adam Uccello , Garrett Warnell , Joydeep Biswas

Graph Neural Model Predictive Control for High-Dimensional Systems

The control of high-dimensional systems, such as soft robots, requires models that faithfully capture complex dynamics while remaining computationally tractable. This work presents a framework that integrates Graph Neural Network…

机器人学 · 计算机科学 2026-03-10 Patrick Benito Eberhard , Luis Pabon , Daniele Gammelli , Hugo Buurmeijer , Amon Lahr , Mark Leone , Andrea Carron , Marco Pavone

Accelerating Robotic Reinforcement Learning with Agent Guidance

Reinforcement Learning (RL) offers a powerful paradigm for autonomous robots to master generalist manipulation skills through trial-and-error. However, its real-world application is stifled by low sample efficiency. Recent Human-in-the-Loop…

机器人学 · 计算机科学 2026-03-10 Haojun Chen , Zili Zou , Chengdong Ma , Yaoxiang Pu , Haotong Zhang , Yuanpei Chen , Yaodong Yang

Task-Oriented Robot-Human Handovers on Legged Manipulators

Task-oriented handovers (TOH) are fundamental to effective human-robot collaboration, requiring robots to present objects in a way that supports the human's intended post-handover use. Existing approaches are typically based on object- or…

机器人学 · 计算机科学 2026-03-10 Andreea Tulbure , Carmen Scheidemann , Elias Steiner , Marco Hutter

Synchronized Online Friction Estimation and Adaptive Grasp Control for Robust Gentle Grasp

We introduce a unified framework for gentle robotic grasping that synergistically couples real-time friction estimation with adaptive grasp control. We propose a new particle filter-based method for real-time estimation of the friction…

机器人学 · 计算机科学 2026-03-10 Zhenwei Niu , Xiaoyi Chen , Jiayu Hu , Zhaoyang Liu , Tang Jian , Xiaozu Ju

AgenticLab: A Real-World Robot Agent Platform that Can See, Think, and Act

Recent advances in large vision-language models (VLMs) have demonstrated generalizable open-vocabulary perception and reasoning, yet their real-robot manipulation capability remains unclear for long-horizon, closed-loop execution in…

机器人学 · 计算机科学 2026-03-10 Pengyuan Guo , Zhonghao Mai , Zhengtong Xu , Kaidi Zhang , Heng Zhang , Zichen Miao , Arash Ajoudani , Zachary Kingston , Qiang Qiu , Yu She

Green-VLA: Staged Vision-Language-Action Model for Generalist Robots

We introduce Green-VLA, a staged Vision-Language-Action (VLA) framework for real-world deployment on the Green humanoid robot while maintaining generalization across diverse embodiments. Green-VLA follows a five stage curriculum: (L0)…

机器人学 · 计算机科学 2026-03-10 I. Apanasevich , M. Artemyev , R. Babakyan , P. Fedotova , D. Grankin , E. Kupryashin , A. Misailidi , D. Nerus , A. Nutalapati , G. Sidorov , I. Efremov , M. Gerasyov , D. Pikurov , Y. Senchenko , S. Davidenko , D. Kulikov , M. Sultankin , K. Askarbek , O. Shamanin , D. Statovoy , E. Zalyaev , I. Zorin , A. Letkin , E. Rusakov , A. Silchenko , V. Vorobyov , S. Sobolnikov , A. Postnikov

BEV-Patch-PF: Particle Filtering with BEV-Aerial Feature Matching for Off-Road Geo-Localization

We propose BEV-Patch-PF, a GPS-free sequential geo-localization system that integrates a particle filter with learned bird's-eye-view (BEV) and aerial feature maps. From onboard RGB and depth images, we construct a BEV feature map. For each…

机器人学 · 计算机科学 2026-03-10 Dongmyeong Lee , Jesse Quattrociocchi , Christian Ellis , Rwik Rana , Amanda Adkins , Adam Uccello , Garrett Warnell , Joydeep Biswas

Multi-directional Safe Rectangle Corridor-Based MPC for Nonholonomic Robots Navigation in Cluttered Environment

Autonomous Mobile Robots (AMRs) have become indispensable in industrial applications due to their operational flexibility and efficiency. Navigation serves as a crucial technical foundation for accomplishing complex tasks. However,…

机器人学 · 计算机科学 2026-03-10 Yinsong Qu , Yunxiang Li , Shanlin Zhong

IPPO Learns the Game, Not the Team: A Study on Generalization in Heterogeneous Agent Teams

Multi-Agent Reinforcement Learning (MARL) is commonly deployed in settings where agents are trained via self-play with homogeneous teammates, often using parameter sharing and a single policy architecture. This opens the question: to what…

机器人学 · 计算机科学 2026-03-10 Ryan LeRoy , Jack Kolb

Stable Multi-Drone GNSS Tracking System for Marine Robots

Stable and accurate tracking is essential for marine robotics, yet Global Navigation Satellite System (GNSS) signals vanish immediately below the sea surface. Traditional alternatives suffer from error accumulation, high computational…

机器人学 · 计算机科学 2026-03-10 Shuo Wen , Edwin Meriaux , Mariana Sosa Guzmán , Zhizun Wang , Junming Shi , Gregory Dudek

MobiDock: Design and Control of A Modular Self Reconfigurable Bimanual Mobile Manipulator via Robotic Docking

Multi-robot systems, particularly mobile manipulators, face challenges in control coordination and dynamic stability when working together. To address this issue, this study proposes MobiDock, a modular self-reconfigurable mobile…

机器人学 · 计算机科学 2026-03-10 Xuan-Thuan Nguyen , Khac Nam Nguyen , Ngoc Duy Tran , Thi Thoa Mac , Anh Nguyen , Hoang Hiep Ly , Tung D. Ta

NaviTrace: Evaluating Embodied Navigation of Vision-Language Models

Vision-language models demonstrate unprecedented performance and generalization across a wide range of tasks and scenarios. Integrating these foundation models into robotic navigation systems opens pathways toward building general-purpose…

机器人学 · 计算机科学 2026-03-10 Tim Windecker , Manthan Patel , Moritz Reuss , Richard Schwarzkopf , Cesar Cadena , Rudolf Lioutikov , Marco Hutter , Jonas Frey

LagMemo: Language 3D Gaussian Splatting Memory for Multi-modal Open-vocabulary Multi-goal Visual Navigation

Navigating to a designated goal using visual information is a fundamental capability for intelligent robots. To address the practical demands of multi-modal, open-vocabulary goal queries and multi-goal visual navigation, we propose LagMemo,…

机器人学 · 计算机科学 2026-03-10 Haotian Zhou , Xiaole Wang , He Li , Zhuo Qi , Jinrun Yin , Haiyu Kong , Jianghuan Xu , Huijing Zhao

Automated Pest Counting in Water Traps through Active Robotic Stirring for Occlusion Handling

Existing image-based pest counting methods rely on single static images and often produce inaccurate results under occlusion. To address this issue, this paper proposes an automated pest counting method in water traps through active robotic…

机器人学 · 计算机科学 2026-03-10 Xumin Gao , Mark Stevens , Grzegorz Cielniak

HumanHalo - Safe and Efficient 3D Navigation Among Humans via Minimally Conservative MPC

Safe and efficient robotic navigation among humans is essential for integrating robots into everyday environments. Most existing approaches focus on simplified 2D crowd navigation and fail to account for the full complexity of human body…

机器人学 · 计算机科学 2026-03-10 Simon Schaefer , Helen Oleynikova , Sandra Hirche , Stefan Leutenegger