机器人学 — Scifaro

From Fold to Function: Simulation-Driven Design of Origami Mechanisms

Origami-inspired mechanisms can transform flat sheets into functional three-dimensional dynamic structures that are lightweight, compact, and capable of complex motion. These properties make origami increasingly valuable in robotic and…

机器人学 · 计算机科学 2026-05-05 Tianhui Han , Shashwat Singh , Sarvesh Patil , Zeynep Temel

Learning to Act Through Contact: A Unified View of Multi-Task Robot Learning

We present a unified framework for multi-task locomotion and manipulation policy learning grounded in a contact-explicit representation. Instead of designing different policies for different tasks, our approach unifies the definition of a…

机器人学 · 计算机科学 2026-05-05 Shafeef Omar , Majid Khadiv

NaviMaster: Learning a Unified Policy for GUI and Embodied Navigation Tasks

Recent advances in Graphical User Interface (GUI) and embodied navigation have driven progress, yet these domains have largely evolved in isolation, with disparate datasets and training paradigms. In this paper, we observe that both tasks…

机器人学 · 计算机科学 2026-05-05 Zhihao Luo , Wentao Yan , Jingyu Gong , Min Wang , Zhizhong Zhang , Xuhong Wang , Yuan Xie , Xin Tan

MorphIt: Flexible Spherical Approximation of Robot Morphology for Representation-driven Adaptation

What if a robot could rethink its own morphological representation to better meet the demands of diverse tasks? Most robotic systems today treat their physical form as a fixed constraint rather than an adaptive resource, forcing the same…

机器人学 · 计算机科学 2026-05-05 Nataliya Nechyporenko , Yutong Zhang , Sean Campbell , Alessandro Roncone

Satellite Autonomous Clock Fault Monitoring with Inter-Satellite Ranges Using Euclidean Distance Matrices

To address the need for robust positioning, navigation, and timing services in lunar environments, this paper proposes a novel onboard clock phase jump detection framework for satellite constellations using range measurements obtained from…

机器人学 · 计算机科学 2026-05-05 Keidai Iiyama , Daniel Neamati , Grace Gao

AutoSpatial: Visual-Language Reasoning for Social Robot Navigation through Efficient Spatial Reasoning Learning

We present a novel method, AutoSpatial, an efficient approach with structured spatial grounding to enhance VLMs' spatial reasoning. By combining minimal manual supervision with large-scale Visual Question-Answering (VQA) pairs…

机器人学 · 计算机科学 2026-05-05 Yangzhe Kong , Daeun Song , Jing Liang , Dinesh Manocha , Ziyu Yao , Xuesu Xiao

Large Language Models for Multi-Robot Systems: A Survey

The rapid advancement of Large Language Models (LLMs) has opened new possibilities in Multi-Robot Systems (MRS), enabling enhanced communication, task allocation and planning, and human-robot interaction. Unlike traditional single-robot and…

机器人学 · 计算机科学 2026-05-05 Peihan Li , Zijian An , Shams Abrar , Lifeng Zhou

QuadPiPS: A Perception-informed Footstep Planner for Quadrupeds With Semantic Affordance Prediction

This work proposes QuadPiPS, a perception-informed framework for quadrupedal foothold planning in the perception space. QuadPiPS employs a novel ego-centric local environment representation, known as the legged egocan, that is extended here…

机器人学 · 计算机科学 2026-05-05 Max Asselmeier , Ye Zhao , Patricio A. Vela

Edge Case Detection in Automated Driving: Methods, Challenges and Future Directions

Automated vehicles promise to enhance transportation safety and efficiency. However, ensuring their reliability in real-world conditions remains challenging, particularly due to rare and unexpected situations known as edge cases. While…

机器人学 · 计算机科学 2026-05-05 Saeed Rahmani , Sabine Rieder , Erwin de Gelder , Marcel Sonntag , Jorge Lorente Mallada , Sytze Kalisvaart , Vahid Hashemi , Bart van Arem , Simeon C. Calvert

Paired-CSLiDAR: Height-Stratified Registration for Cross-Source Aerial-Ground LiDAR Pose Refinement

We introduce Paired-CSLiDAR (CSLiDAR), a cross-source aerial-ground LiDAR benchmark for single-scan pose refinement: refining a ground-scan pose within a 50 m-radius aerial crop. The benchmark contains 12,683 ground-aerial pairs across 6…

机器人学 · 计算机科学 2026-05-04 Montana Hoover , Jing Liang , Tianrui Guan , Dinesh Manocha

MSACT: Multistage Spatial Alignment for Stable Low-Latency Fine Manipulation

Real-world fine manipulation, particularly in bimanual manipulation, typically requires low-latency control and stable visual localization, while collecting large-scale data is costly and limited demonstrations may lead to localization…

机器人学 · 计算机科学 2026-05-04 Xianbo Cai , Hideyuki Ichiwara , Masaki Yoshikawa , Tetsuya Ogata

Stereo Multistage Spatial Attention for Real-Time Mobile Manipulation Under Visual Scale Variation and Disturbances

Robots operating in open, unstructured real-world environments must rely on onboard visual perception while autonomously moving across different locations. Continuous changes in onboard camera viewpoints cause significant visual scale…

机器人学 · 计算机科学 2026-05-04 Xianbo Cai , Hideyuki Ichiwara , Hyogo Hiruma , Masaki Yoshikawa , Hiroshi Ito , Tetsuya Ogata

Learning while Deploying: Fleet-Scale Reinforcement Learning for Generalist Robot Policies

Generalist robot policies increasingly benefit from large-scale pretraining, but offline data alone is insufficient for robust real-world deployment. Deployed robots encounter distribution shifts, long-tail failures, task variations, and…

机器人学 · 计算机科学 2026-05-04 Yi Wang , Xinchen Li , Pengwei Xie , Pu Yang , Buqing Nie , Yunuo Cai , Qinglin Zhang , Chendi Qu , Jeffrey Wu , Jianheng Song , Xinlin Ren , Jingshun Huang , Mingjie Pan , Siyuan Feng , Zhi Chen , Jianlan Luo

MiniVLA-Nav v1: A Multi-Scene Simulation Dataset for Language-Conditioned Robot Navigation

We present MiniVLA-Nav v1, a simulation dataset for Language-Conditioned Object Approach (LCOA) navigation: given a short natural-language instruction, an NVIDIA Nova Carter differential-drive robot must navigate to the named object and…

机器人学 · 计算机科学 2026-05-04 Ali Al-Bustami , Jaerock Kwon

PrefMoE: Robust Preference Modeling with Mixture-of-Experts Reward Learning

Preference-based reinforcement learning offers a scalable alternative to manual reward engineering by learning reward structures from comparative feedback. However, large-scale preference datasets, whether collected from crowdsourced…

机器人学 · 计算机科学 2026-05-04 Ziqin Yuan , Ruiqi Wang , Dezhong Zhao , Baijian Yang , Byung-Cheol Min

Embodied Interpretability: Linking Causal Understanding to Generalization in Vision-Language-Action Models

Vision-Language-Action (VLA) policies often fail under distribution shift, suggesting that decisions may depend on spurious visual correlations rather than task-relevant causes. We formulate visual-action attribution as an interventional…

机器人学 · 计算机科学 2026-05-04 Hanxin Zhang , Mingshuo Xu , Abdulqader Dhafer , Shigang Yue , Hongbiao Dong , Zhou Daniel Hao

A Model-based Visual Contact Localization and Force Sensing System for Compliant Robotic Grippers

Grasp force estimation can help prevent robots from damaging delicate objects during manipulation and improve learning-based robotic control. Integrating force sensing into deformable grippers negotiates trade-offs in cost, complexity,…

机器人学 · 计算机科学 2026-05-04 Kaiwen Zuo , Shuyuan Yang , Zonghe Chua

Task-Conditioned Uncertainty Costmaps for Legged Locomotion

Legged robots maintain dynamic feasibility through multicontact interactions with terrain. Learned foothold prediction can provide feasibility-aware costs for motion planning and path selection, but accurately predicting future contacts…

机器人学 · 计算机科学 2026-05-04 Kartikeya Singh , Christo Aluckal , Romeo Orsolino , Karthik Dantu

Lucid-XR: An Extended-Reality Data Engine for Robotic Manipulation

We introduce Lucid-XR, a generative data engine for creating diverse and realistic-looking multi-modal data to train real-world robotic systems. At the core of Lucid-XR is vuer, a web-based physics simulation environment that runs directly…

机器人学 · 计算机科学 2026-05-04 Yajvan Ravan , Adam Rashid , Alan Yu , Kai McClennen , Gio Huh , Kevin Yang , Zhutian Yang , Qinxi Yu , Xiaolong Wang , Phillip Isola , Ge Yang

E$^2$DT: Efficient and Effective Decision Transformer with Experience-Aware Sampling for Robotic Manipulation

In reinforcement learning (RL) for robotic manipulation, the Decision Transformer (DT) has emerged as an effective framework for addressing long-horizon tasks. However, DT's performance depends heavily on the coverage of collected…

机器人学 · 计算机科学 2026-05-04 Kaiyan Zhao , Borong Zhang , Yiming Wang , Xingyu Liu , Xuetao Li , Yuyang Chen , Xiaoguang Niu