机器人学 — Scifaro

An Aerial Manipulator for Perception-Driven Flower Targeting Toward Contactless Pollination in Vertical Farming

The decline of natural pollinators has created a major challenge for crop production in controlled indoor agriculture, particularly in vertical farming environments where natural insect pollination is absent. This motivates the development…

机器人学 · 计算机科学 2026-05-11 Chenzhe Jin , Zhuohang Wu , Yifan Cai , Xiangqi Li , Jan Ming Kevin Tan , Narsimlu Kemsaram , Valerio Modugno

VLA-GSE: Boosting Parameter-Efficient Fine-Tuning in VLA with Generalized and Specialized Experts

Vision-language-action (VLA) models inherit rich visual-semantic priors from pre-trained vision-language backbones, but adapting them to robotic control remains challenging. Full fine-tuning (FFT) is prone to overfitting on downstream…

机器人学 · 计算机科学 2026-05-11 Yuhua Jiang , Junjie Lu , Xinyao Qin , Xiaoyu Chen , Kaixin Wang , Feifei Gao , Li Zhao

LineRides: Line-Guided Reinforcement Learning for Bicycle Robot Stunts

Designing reward functions for agile robotic maneuvers in reinforcement learning remains difficult, and demonstration-based approaches often require reference motions that are unavailable for novel platforms or extreme stunts. We present…

机器人学 · 计算机科学 2026-05-11 Seungeun Rho , Shamel Fahmi , Jeonghwan Kim , Arianna Ilvonen , Sehoon Ha , Gabriel Nelson

MolmoAct2: Action Reasoning Models for Real-world Deployment

Vision-Language-Action (VLA) models aim to provide a single generalist controller for robots, but today's systems fall short on the criteria that matter for real-world deployment. Frontier models are closed, open-weight alternatives are…

机器人学 · 计算机科学 2026-05-11 Haoquan Fang , Jiafei Duan , Donovan Clay , Sam Wang , Shuo Liu , Weikai Huang , Xiang Fan , Wei-Chuan Tsai , Shirui Chen , Yi Ru Wang , Shanli Xing , Jaemin Cho , Jae Sung Park , Ainaz Eftekhar , Peter Sushko , Karen Farley , Angad Wadhwa , Cole Harrison , Winson Han , Ying-Chun Lee , Eli VanderBilt , Rose Hendrix , Suveen Ellawela , Lucas Ngoo , Joyce Chai , Zhongzheng Ren , Ali Farhadi , Dieter Fox , Ranjay Krishna

TAIL-Safe: Task-Agnostic Safety Monitoring for Imitation Learning Policies

Recent imitation learning (IL) algorithms such as flow-matching and diffusion policies demonstrate remarkable performance in learning complex manipulation tasks. However, these policies often fail even when operating within their training…

机器人学 · 计算机科学 2026-05-11 Riad Ahmed , Momotaz Begum

Affordance Agent Harness: Verification-Gated Skill Orchestration

Affordance grounding requires identifying where and how an agent should interact in open-world scenes, where actionable regions are often small, occluded, reflective, and visually ambiguous. Recent systems therefore combine multiple skills…

机器人学 · 计算机科学 2026-05-11 Haojian Huang , Jiahao Shi , Yinchuan Li , Yingcong Chen

3D Generation for Embodied AI and Robotic Simulation: A Survey

Embodied AI and robotic systems increasingly depend on scalable, diverse, and physically grounded 3D content for simulation-based training and real-world deployment. While 3D generative modeling has advanced rapidly, embodied applications…

机器人学 · 计算机科学 2026-05-11 Tianwei Ye , Yifan Mao , Minwen Liao , Jian Liu , Chunchao Guo , Dazhao Du , Quanxin Shou , Fangqi Zhu , Song Guo

Agent-Centric Observation Adaptation for Robust Visual Control under Dynamic Perturbations

Real-world visual systems face time-varying perturbations, including weather, sensor noise, compression artifacts, and background distractions. Existing image restoration methods are typically designed for fixed corruption types and…

机器人学 · 计算机科学 2026-05-11 Zhengru Fang , Yu Guo , Fei Liu , Yuang Zhang , Yihang Tao , Senkang Hu , Wenbo Ding , Yuguang Fang

Task-Adaptive Admittance Control for Human-Quadrotor Cooperative Load Transportation with Dynamic Cable-Length Regulation

The collaboration between humans and robots is critical in many robotic applications, especially in those requiring physical human-robot interaction (pHRI). Previous research in pHRI has largely focused on robotic manipulators, employing…

机器人学 · 计算机科学 2026-05-11 Shuai Li , Ton T. H. Duong , Damiano Zanotto

GustPilot: A Hierarchical DRL-INDI Framework for Wind-Resilient Quadrotor Navigation

Wind disturbances remain a key barrier to reliable autonomous navigation for lightweight quadrotors, where the rapidly varying airflow can destabilize both planning and tracking. This paper introduces GustPilot, a hierarchical…

机器人学 · 计算机科学 2026-05-11 Amir Atef Habel , Roohan Ahmed Khan , Fawad Mehboob , Clement Fortin , Dzmitry Tsetserukou

Dynamic Properties and Motion Reproducibility of a Compact Pneumatically Actuated Humanoid Upper Body for Data-Driven Control

Pneumatically-actuated anthropomorphic robots with high degrees of freedom (DOF) offer significant potential for physical human-robot interaction. However, precise control of pneumatic actuators is challenging due to their inherent…

机器人学 · 计算机科学 2026-05-11 Hiroshi Atsuta , Hisashi Ishihara , Minoru Asada

Contact-Grounded Policy: Dexterous Visuotactile Policy with Generative Contact Grounding

Contact-rich dexterous manipulation with multi-finger hands remains an open challenge in robotics because task success depends on multi-point contacts that continuously evolve and are highly sensitive to object geometry, frictional…

机器人学 · 计算机科学 2026-05-11 Zhengtong Xu , Yeping Wang , Ben Abbatematteo , Jom Preechayasomboon , Sonny Chan , Nick Colonnese , Amirhossein H. Memar

SeedPolicy: Horizon Scaling via Self-Evolving Diffusion Policy for Robot Manipulation

Imitation Learning (IL) enables robots to acquire manipulation skills from expert demonstrations. Diffusion Policy (DP) models multi-modal expert behaviors but degrades when naively increasing stacked observation horizons, limiting…

机器人学 · 计算机科学 2026-05-11 Youqiang Gui , Yuxuan Zhou , Shen Cheng , Xinyang Yuan , Haoqiang Fan , Peng Cheng , Shuaicheng Liu

A Cost-Effective and Climate-Resilient Air Pressure System for Rain Effect Reduction on Automated Vehicle Cameras

Recent advances in automated vehicles have focused on improving perception performance under adverse weather conditions; however, research on physical hardware solutions remains limited, despite their importance for perception critical…

机器人学 · 计算机科学 2026-05-11 Mohamed Sabry , Joseba Gorospe , Cristina Olaverri-Monreal

Docking and Persistent Operations for a Resident Underwater Vehicle

Our understanding of the oceans remains limited by sparse and infrequent observations, primarily because current methods are constrained by the high cost and logistical effort of underwater monitoring, relying either on sporadic surveys…

机器人学 · 计算机科学 2026-05-11 Leonard Günzel , Gabrielė Kasparavičiūtė , Ambjørn Grimsrud Waldum , Bjørn-Magnus Moslått , Abubakar Aliyu Badawi , Celil Yılmaz , Md Shamin Yeasher Yousha , Robert Staven , Martin Ludvigsen

HAIC: Humanoid Agile Object Interaction Control via Dynamics-Aware World Model

Humanoid robots show promise for complex whole-body tasks in unstructured environments. Although Human-Object Interaction (HOI) has advanced, most methods focus on fully actuated objects rigidly coupled to the robot, ignoring underactuated…

机器人学 · 计算机科学 2026-05-11 Dongting Li , Xingyu Chen , Qianyang Wu , Bo Chen , Sikai Wu , Hanyu Wu , Guoyao Zhang , Liang Li , Mingliang Zhou , Diyun Xiang , Jianzhu Ma , Qiang Zhang , Renjing Xu

DynaRetarget: Dynamically-Feasible Retargeting using Sampling-Based Trajectory Optimization

In this paper, we introduce DynaRetarget, a complete pipeline for retargeting human motions to humanoid control policies. The core component of DynaRetarget is a novel Sampling-Based Trajectory Optimization (SBTO) framework that refines…

机器人学 · 计算机科学 2026-05-11 Victor Dhedin , Ilyass Taouil , Shafeef Omar , Dian Yu , Kun Tao , Angela Dai , Majid Khadiv

Latent Reasoning VLA: Latent Thinking and Prediction for Vision-Language-Action Models

Vision-Language-Action (VLA) models benefit from chain-of-thought (CoT) reasoning, but existing approaches incur high inference overhead and rely on discrete reasoning representations that mismatch continuous perception and control. We…

机器人学 · 计算机科学 2026-05-11 Shuanghao Bai , Jing Lyu , Wanqi Zhou , Zhe Li , Dakai Wang , Lei Xing , Xiaoguang Zhao , Pengwei Wang , Zhongyuan Wang , Cheng Chi , Badong Chen , Shanghang Zhang

DisCo-FLoc: Semantic-Free Floorplan Localization via $SE(2)$-Aware Contrastive Disambiguation

Visual Floorplan Localization (FLoc) struggles with severe structural aliasing caused by repetitive minimalist layouts. This occurs because physically distant poses share highly similar visual-geometric features, which degrades spatial…

机器人学 · 计算机科学 2026-05-11 Ping Zhong , Shiyong Meng , Bolei Chen , Tao Zou , Chaoxu Mu , Jianxin Wang

Large Video Planner Enables Generalizable Robot Control

General-purpose robots require decision-making models that generalize across diverse tasks and environments. Recent works build robot foundation models by extending multimodal large language models (MLLMs) with action outputs, creating…

机器人学 · 计算机科学 2026-05-11 Boyuan Chen , Tianyuan Zhang , Haoran Geng , Caiyi Zhang , Peihao Li , Kiwhan Song , William T. Freeman , Jitendra Malik , Pieter Abbeel , Russ Tedrake , Vincent Sitzmann , Yilun Du