机器人学 — Scifaro

Optimizing Control-Friendly Trajectories with Self-Supervised Residual Learning

Real-world physics can only be analytically modeled with a certain level of precision for modern intricate robotic systems. As a result, tracking aggressive trajectories accurately could be challenging due to the existence of residual…

机器人学 · 计算机科学 2026-04-16 Kexin Guo , Zihan Yang , Yuhang Liu , Jindou Jia , Xiang Yu

LEO-RobotAgent: A General-purpose Robotic Agent for Language-driven Embodied Operator

We propose LEO-RobotAgent, a general-purpose language-driven intelligent agent framework for robots. Under this framework, LLMs can operate different types of robots to complete unpredictable complex tasks across various scenarios. This…

机器人学 · 计算机科学 2026-04-16 Lihuang Chen , Xiangyu Luo , Jun Meng

Inertial Magnetic SLAM Systems Using Low-Cost Sensors

Spatially inhomogeneous magnetic fields offer a valuable, non-visual information source for positioning. Among systems leveraging this, magnetic field-based simultaneous localization and mapping (SLAM) systems are particularly attractive.…

机器人学 · 计算机科学 2026-04-16 Chuan Huang , Gustaf Hendeby , Isaac Skog

IGen: Scalable Data Generation for Robot Learning from Open-World Images

The rise of generalist robotic policies has created an exponential demand for large-scale training data. However, on-robot data collection is labor-intensive and often limited to specific environments. In contrast, open-world images capture…

机器人学 · 计算机科学 2026-04-16 Chenghao Gu , Haolan Kang , Junchao Lin , Jinghe Wang , Duo Wu , Shuzhao Xie , Fanding Huang , Junchen Ge , Ziyang Gong , Letian Li , Hongying Zheng , Changwei Lv , Zhi Wang

Robust Verification of Controllers under State Uncertainty via Hamilton-Jacobi Reachability Analysis

As perception-based controllers for autonomous systems become increasingly popular in the real world, it is important that we can formally verify their safety and performance despite perceptual uncertainty. Unfortunately, the verification…

机器人学 · 计算机科学 2026-04-16 Albert Lin , Alessandro Pinto , Somil Bansal

RoboTAG: End-to-end Robot Configuration Estimation via Topological Alignment Graph

Estimating robot pose from a monocular RGB image is a challenge in robotics and computer vision. Existing methods typically build networks on top of 2D visual backbones and depend heavily on labeled data for training, which is often scarce…

机器人学 · 计算机科学 2026-04-16 Yifan Liu , Fangneng Zhan , Wanhua Li , Haowen Sun , Katerina Fragkiadaki , Hanspeter Pfister

X-Diffusion: Training Diffusion Policies on Cross-Embodiment Human Demonstrations

Human videos are a scalable source of training data for robot learning. However, humans and robots significantly differ in embodiment, making many human actions infeasible for direct execution on a robot. Still, these demonstrations convey…

机器人学 · 计算机科学 2026-04-16 Maximus A. Pace , Prithwish Dan , Chuanruo Ning , Atiksh Bhardwaj , Audrey Du , Edward W. Duan , Wei-Chiu Ma , Kushal Kedia

Hierarchical DLO Routing with Reinforcement Learning and In-Context Vision-language Models

Long-horizon routing tasks of deformable linear objects (DLOs), such as cables and ropes, are common in industrial assembly lines and everyday life. These tasks are particularly challenging because they require robots to manipulate DLO with…

机器人学 · 计算机科学 2026-04-16 Mingen Li , Houjian Yu , Yixuan Huang , Youngjin Hong , Hantao Ye , Changhyun Choi

HAMLET: Switch your Vision-Language-Action Model into a History-Aware Policy

Inherently, robotic manipulation tasks are history-dependent: leveraging past context could be beneficial. However, most existing Vision-Language-Action models (VLAs) have been designed without considering this aspect, i.e., they rely…

机器人学 · 计算机科学 2026-04-16 Myungkyu Koo , Daewon Choi , Taeyoung Kim , Kyungmin Lee , Changyeon Kim , Younggyo Seo , Jinwoo Shin

GRITS: A Spillage-Aware Guided Diffusion Policy for Robot Food Scooping Tasks

Robotic food scooping is a critical manipulation skill for food preparation and service robots. However, existing robot learning algorithms, especially learn-from-demonstration methods, still struggle to handle diverse and dynamic food…

机器人学 · 计算机科学 2026-04-16 Yen-Ling Tai , Yi-Ru Yang , Kuan-Ting Yu , Yu-Wei Chao , Yi-Ting Chen

FiLM-Nav: Efficient and Generalizable Navigation via VLM Fine-tuning

Enabling robotic assistants to navigate complex environments and locate objects described in free-form language is a critical capability for real-world deployment. While foundation models, particularly Vision-Language Models (VLMs), offer…

机器人学 · 计算机科学 2026-04-16 Naoki Yokoyama , Sehoon Ha

Safe and Nonconservative Contingency Planning for Autonomous Vehicles via Online Learning-Based Reachable Set Barriers

Autonomous vehicles must navigate dynamically uncertain environments while balancing safety and efficiency. This challenge is exacerbated by unpredictable human-driven vehicle (HV) behaviors and perception inaccuracies, necessitating…

机器人学 · 计算机科学 2026-04-16 Rui Yang , Lei Zheng , Shuzhi Sam Ge , Jun Ma

FCBV-Net: Category-Level Robotic Garment Smoothing via Feature-Conditioned Bimanual Value Prediction

Category-level generalization for robotic garment manipulation, such as bimanual smoothing, remains a significant hurdle due to high dimensionality, complex dynamics, and intra-category variations. Current approaches often struggle, either…

机器人学 · 计算机科学 2026-04-16 Mohammed Daba , Jing Qiu

Robust Route Planning for Sidewalk Delivery Robots

Sidewalk delivery robots are a promising solution for last-mile freight distribution. Yet, they operate in dynamic environments characterized by pedestrian flows and potential obstacles, which make travel times highly uncertain and can…

机器人学 · 计算机科学 2026-04-16 Xing Tong , Michele D. Simoni

Behavior Synthesis via Contact-Aware Fisher Information Maximization

Contact dynamics hold immense amounts of information that can improve a robot's ability to characterize and learn about objects in their environment through interactions. However, collecting information-rich contact data is challenging due…

机器人学 · 计算机科学 2026-04-16 Hrishikesh Sathyanarayan , Ian Abraham

Data-Driven Contact-Aware Control Method for Real-Time Deformable Tool Manipulation: A Case Study in the Environmental Swabbing

Deformable Object Manipulation (DOM) remains a critical challenge in robotics due to the complexities of developing suitable model-based control strategies. Deformable Tool Manipulation (DTM) further complicates this task by introducing…

机器人学 · 计算机科学 2026-04-16 Siavash Mahmoudi , Amirreza Davar , Dongyi Wang

DINO-Explorer: Active Underwater Discovery via Ego-Motion Compensated Semantic Predictive Coding

Marine ecosystem degradation necessitates continuous, scientifically selective underwater monitoring. However, most autonomous underwater vehicles (AUVs) operate as passive data loggers, capturing exhaustive video for offline review and…

机器人学 · 计算机科学 2026-04-15 Yuhan Jin , Nayari Marie Lessa , Mariela De Lucas Alvarez , Melvin Laux , Lucas Amparo Barbosa , Frank Kirchner , Rebecca Adam

E2E-Fly: An Integrated Training-to-Deployment System for End-to-End Quadrotor Autonomy

Training and transferring learning-based policies for quadrotors from simulation to reality remains challenging due to inefficient visual rendering, physical modeling inaccuracies, unmodeled sensor discrepancies, and the absence of a…

机器人学 · 计算机科学 2026-04-15 Fangyu Sun , Fanxing Li , Linzuo Zhang , Yu Hu , Renbiao Jin , Shuyu Wu , Wenxian Yu , Danping Zou

Tree Learning: A Multi-Skill Continual Learning Framework for Humanoid Robots

As reinforcement learning for humanoid robots evolves from single-task to multi-skill paradigms, efficiently expanding new skills while avoiding catastrophic forgetting has become a key challenge in embodied intelligence. Existing…

机器人学 · 计算机科学 2026-04-15 Yifei Yan , Linqi Ye

Robotic Manipulation is Vision-to-Geometry Mapping ($f(v) \rightarrow G$): Vision-Geometry Backbones over Language and Video Models

At its core, robotic manipulation is a problem of vision-to-geometry mapping ($f(v) \rightarrow G$). Physical actions are fundamentally defined by geometric properties like 3D positions and spatial relationships. Consequently, we argue that…

机器人学 · 计算机科学 2026-04-15 Zijian Song , Qichang Li , Jiawei Zhou , Zhenlong Yuan , Tianshui Chen , Liang Lin , Guangrun Wang