机器人学 — Scifaro

LiPS: Lightweight Panoptic Segmentation for Resource-Constrained Robotics

Panoptic segmentation is a key enabler for robotic perception, as it unifies semantic understanding with object-level reasoning. However, the increasing complexity of state-of-the-art models makes them unsuitable for deployment on…

机器人学 · 计算机科学 2026-05-19 Calvin Galagain , Martyna Poreba , François Goulette , Cyrill Stachniss

SutureFormer: Learning Surgical Trajectories via Goal-conditioned Offline RL in Pixel Space

Predicting surgical needle trajectories from endoscopic video is critical for robot-assisted suturing, enabling anticipatory planning, real-time guidance, and safer motion execution. Existing methods that directly learn motion distributions…

机器人学 · 计算机科学 2026-05-19 Huanrong Liu , Chunlin Tian , Tongyu Jia , Tailai Zhou , Qin Liu , Yu Gao , Yutong Ban , Yun Gu , Guy Rosman , Xin Ma , Qingbiao Li

Bio-Inspired Event-Based Visual Servoing for Ground Robots

Biological sensory systems are inherently adaptive, filtering out constant stimuli and prioritizing relative changes, likely enhancing computational and metabolic efficiency. Inspired by active sensing behaviors across a wide range of…

机器人学 · 计算机科学 2026-05-19 Maral Mordad , Kian Behzad , Debojyoti Biswas , Noah J. Cowan , Milad Siami

FASTER: Rethinking Real-Time Flow VLAs

Real-time execution is crucial for deploying Vision-Language-Action (VLA) models in the physical world. Existing asynchronous inference methods primarily optimize trajectory smoothness, but neglect the critical latency in reacting to…

机器人学 · 计算机科学 2026-05-19 Yuxiang Lu , Zhe Liu , Xianzhe Fan , Zhenya Yang , Jinghua Hou , Junyi Li , Kaixin Ding , Hengshuang Zhao

Multi-Source Human-in-the-Loop Digital Twin Testbed for Connected and Autonomous Vehicles in Mixed Traffic Flow

In the emerging mixed traffic environments, Connected and Autonomous Vehicles (CAVs) have to interact with surrounding human-driven vehicles (HDVs). This paper introduces MSH-MCCT (Multi-Source Human-in-the-Loop Mixed Cloud Control…

机器人学 · 计算机科学 2026-05-19 Jianghong Dong , Chunying Yang , Mengchi Cai , Chaoyi Chen , Qing Xu , Jianqiang Wang , Jiawei Wang , Keqiang Li

OxyGen: Unified KV Cache Management for VLA Inference under Multi-Task Parallelism

Embodied AI agents increasingly require parallel execution of multiple tasks, such as manipulation, conversation, and memory construction, from shared observations under distinct time constraints. Recent Mixture-of-Transformers (MoT)…

机器人学 · 计算机科学 2026-05-19 Xiangyu Li , Huaizhi Tang , Xin Ding , Weijun Wang , Ting Cao , Yunxin Liu

HandelBot: Real-World Piano Playing via Fast Adaptation of Dexterous Robot Policies

Mastering dexterous manipulation with multi-fingered hands has been a grand challenge in robotics for decades. Despite its potential, the difficulty of collecting high-quality data remains a primary bottleneck for high-precision tasks.…

机器人学 · 计算机科学 2026-05-19 Amber Xie , Haozhi Qi , Dorsa Sadigh

Efficient Trajectory Optimization for Autonomous Racing via Formula-1 Data-Driven Initialization

Trajectory optimization is a central component of fast and efficient autonomous racing. However practical optimization pipelines remain highly sensitive to initialization and may converge slowly or to suboptimal local solutions when seeded…

机器人学 · 计算机科学 2026-05-19 Samir Shehadeh , Lukas Kutsch , Nils Dengler , Sicong Pan , Maren Bennewitz

cuNRTO: GPU-Accelerated Nonlinear Robust Trajectory Optimization

Robust trajectory optimization enables autonomous systems to operate safely under uncertainty by computing control policies that satisfy the constraints for all bounded disturbances. However, these problems often lead to large Second Order…

机器人学 · 计算机科学 2026-05-19 Jiawei Wang , Arshiya Taj Abdul , Evangelos A. Theodorou

Unleashing the Potential of Diffusion Models for End-to-End Autonomous Driving

Diffusion models have become a popular choice for decision-making tasks in robotics, and more recently, are also being considered for solving autonomous driving tasks. However, their applications and evaluations in autonomous driving remain…

机器人学 · 计算机科学 2026-05-19 Yinan Zheng , Tianyi Tan , Bin Huang , Enguang Liu , Ruiming Liang , Jianlin Zhang , Jianwei Cui , Guang Chen , Kun Ma , Hangjun Ye , Long Chen , Ya-Qin Zhang , Xianyuan Zhan , Jingjing Liu

Global Prior Meets Local Consistency: Dual-Memory Augmented Vision-Language-Action Model for Efficient Robotic Manipulation

Hierarchical Vision-Language-Action (VLA) models have rapidly become a dominant paradigm for robotic manipulation. It typically comprising a Vision-Language backbone for perception and understanding, together with a generative policy for…

机器人学 · 计算机科学 2026-05-19 Zaijing Li , Bing Hu , Rui Shao , Gongwei Chen , Dongmei Jiang , Pengwei Xie , Jianye Hao , Liqiang Nie

One Hand to Rule Them All: Canonical Representations for Unified Dexterous Manipulation

Dexterous manipulation policies today largely assume fixed hand designs, severely restricting their generalization to new embodiments with varied kinematic and structural layouts. To overcome this limitation, we introduce a parameterized…

机器人学 · 计算机科学 2026-05-19 Zhenyu Wei , Yunchao Yao , Mingyu Ding

Learning Native Continuation for Action Chunking Flow Policies

Action chunking enables Vision Language Action (VLA) models to run in real time, but naive chunked execution often exhibits discontinuities at chunk boundaries. Real-Time Chunking (RTC) alleviates this issue but is external to the policy,…

机器人学 · 计算机科学 2026-05-19 Yufeng Liu , Hang Yu , Juntu Zhao , Bocheng Li , Di Zhang , Mingzhu Li , Wenxuan Wu , Yingdong Hu , Junyuan Xie , Junliang Guo , Dequan Wang , Yang Gao

Real-to-Sim for Highly Cluttered Environments via Physics-Consistent Inter-Object Reasoning

Reconstructing physically valid 3D scenes from single-view observations is a prerequisite for bridging the gap between visual perception and robotic control. However, in scenarios requiring precise contact reasoning, such as robotic…

机器人学 · 计算机科学 2026-05-19 Tianyi Xiang , Jiahang Cao , Sikai Guo , Guoyang Zhao , Andrew F. Luo , Jun Ma

Towards Long-Lived Robots: Continual Learning VLA Models via Reinforcement Fine-Tuning

Pretrained on large-scale and diverse datasets, VLA models demonstrate strong generalization and adaptability as general-purpose robotic policies. However, Supervised Fine-Tuning (SFT), which serves as the primary mechanism for adapting…

机器人学 · 计算机科学 2026-05-19 Yuan Liu , Haoran Li , Shuai Tian , Yuxing Qin , Yuhui Chen , Yupeng Zheng , Yongzhen Huang , Dongbin Zhao

Self-Supervised Bootstrapping of Action-Predictive Embodied Reasoning

Embodied Chain-of-Thought (CoT) reasoning has significantly enhanced Vision-Language-Action (VLA) models, yet current methods rely on rigid templates to specify reasoning primitives (e.g., objects in the scene, high-level plans, structural…

机器人学 · 计算机科学 2026-05-19 Milan Ganai , Katie Luo , Jonas Frey , Clark Barrett , Marco Pavone

SuReNav: Superpixel Graph-based Constraint Relaxation for Navigation in Over-constrained Environments

We address the over-constrained planning problem in semi-static environments. The planning objective is to find a best-effort solution that avoids all hard constraint regions while minimally traversing the least risky areas. Conventional…

机器人学 · 计算机科学 2026-05-19 Keonyoung Koh , Moonkyeong Jung , Samuel Seungsup Lee , Daehyung Park

PLATO Hand: Shaping Contact Behavior with Fingernails for Precise Manipulation

We present the PLATO Hand, a dexterous robotic hand with a hybrid fingertip that combines a rigid fingernail, embedded distal phalanx, and compliant pulp to shape contact behavior during manipulation. \rrev{By mechanically organizing how…

机器人学 · 计算机科学 2026-05-19 Dong Ho Kang , Aaron Kim , Mingyo Seo , Kazuto Yokoyama , Tetsuya Narita , Luis Sentis

Adaptive Control in Autonomous Driving via Real-Time Recurrent RL

We study online fine-tuning of pretrained control policies for autonomous driving using Real-Time Recurrent Reinforcement Learning (RTRRL), a memory-efficient algorithm that updates policy parameters at every time step without…

机器人学 · 计算机科学 2026-05-19 Julian Lemmel , Felix Resch , Mónika Farsang , Ramin Hasani , Daniela Rus , Radu Grosu

CoLA-Flow Policy: Temporally Coherent Imitation Learning via Continuous Latent Action Flow Matching for Robotic Manipulation

Learning long-horizon robotic manipulation requires jointly achieving expressive behavior modeling, real-time inference, and stable execution, which remains challenging for existing generative policies. Diffusion-based approaches offer…

机器人学 · 计算机科学 2026-05-19 Wu Songwei , Jiang Zhiduo , Sun Wandong , Xie Guanghu , Zhao Rui , Liu Hong , Liu Yang