机器人学 — Scifaro

Drifting in the Future: Stabilizing Path Following Drifting on High-Latency Vehicle Systems

Autonomously controlling and handling a vehicle at and beyond its stability limit is a mathematically and computationally demanding task. Prior demonstrations of automated drifting have been limited to research platforms with instantaneous…

机器人学 · 计算机科学 2026-06-26 Frederik Werner , Till Heintzenberg , Markus Lienkamp , Johannes Betz

Scalable Behavior Cloning with Open Data, Training, and Evaluation

We introduce ABC, a fully open-source stack for manipulation with behavior cloning. At its core is ABC-130K: the largest open-source teleoperation dataset to date, featuring 3,500 hours of data spanning over 130K episodes across 195 diverse…

机器人学 · 计算机科学 2026-06-25 Arthur Allshire , Himanshu Gaurav Singh , Ritvik Singh , Adam Rashid , Hongsuk Choi , David McAllister , Justin Yu , Yiyuan Chen , Huang Huang , Pieter Abbeel , Xi Chen , Rocky Duan , Phillip Isola , Jitendra Malik , Fred Shentu , Guanya Shi , Philipp Wu , Angjoo Kanazawa

World Action Models Enable Continual Imitation Learning with Recurrent Generative Replays

Going beyond predicting robot actions, World Action Models (WAMs) can also generate future visual observations. We build on this generative capability to propose Recurrent Generative Replay (REGEN), a continual imitation learning framework…

机器人学 · 计算机科学 2026-06-25 Manish Kumar Govind , Dominick Reilly , Smit Patel , Hieu Le , Srijan Das

RouterVLA: Turning Smoke Tests into Supervision for Heterogeneous VLA Selection

We study whether pre-deployment evaluation rollouts can be reused to supervise policy selection. Robot teams routinely smoke test candidate vision-language-action (VLA) policies, then compress those trials into a global winner. RouterVLA…

机器人学 · 计算机科学 2026-06-25 Xingyu Ren , Chugang Yi , Ge Ma , Youran Sun

Continual Robot Policy Learning via Variational Neural Dynamics

Robots deployed in the real world rarely operate under a single fixed dynamics model: wind changes, payloads vary, batteries drain, contacts shift, and hardware wears. Yet most learning-based controllers are trained once and deployed as if…

机器人学 · 计算机科学 2026-06-25 Jiaxu Xing , Zhiyuan Zhu , Yunfan Ren , Ismail Geles , Yifan Zhai , Rudolf Reiter , Davide Scaramuzza

Bridging Performance and Generalization in Reinforcement Learning for Agile Flight

Autonomous drone racing is a fundamentally challenging regime for autonomous aerial robots, requiring time-optimal control while operating under persistent actuation saturation. While reinforcement learning (RL) has achieved human-level…

机器人学 · 计算机科学 2026-06-25 Jonathan Green , Jiaxu Xing , Nico Messikommer , Angel Romero , Davide Scaramuzza

VibeAct: Vibration to Actions for Contact-Rich Reactive Robot Dexterity

Dexterous manipulation depends on contact events that are fast, local, and often visually occluded. Piezoelectric microphones offer a compact and high-bandwidth way to sense these interactions, but the resulting vibro-acoustic signals are…

机器人学 · 计算机科学 2026-06-25 Yuemin Mao , Uksang Yoo , Jean Oh , Jonathan Francis , Jeffrey Ichnowski

LA4VLA: Learning to Act without Seeing via Language-Action Pretraining

Vision-Language-Action (VLA) models are commonly pretrained on robot demonstrations by jointly mapping visual observations and language instructions to actions. However, dense visual-action supervision can dominate the comparatively sparse…

机器人学 · 计算机科学 2026-06-25 Tao Lin , Yuxin Du , Yiran Mao , Zewei Ye , Yilei Zhong , Bing Cheng , Yiming Wang , Jiting Liu , Yang Tian , Junchi Yan , Feiran Wu , Zenan Meng , Hu Wei , Yuqian Fu , Gen Li , Bo Zhao

BOWConnect: Parallel Bayesian Optimization over Windows with Learned Local Cost Maps for Sample-Efficient Kinodynamic Motion Planning

This paper presents BOWConnect, a bidirectional parallel kinodynamic motion planner that addresses three fundamental limitations of existing sampling-based methods: sample inefficiency in high-dimensional state spaces, unreliable cost…

机器人学 · 计算机科学 2026-06-25 Sourav Raxit , Abdullah Al Redwan Newaz , Jose Fuentes , Leonardo Bobadilla

E-TTS: A New Embodied Test-Time Scaling Framework for Robotic Manipulation

Recently, a few works have made early attempts to study test-time scaling for embodied tasks. However, two major challenges remain unsolved: (1) reasoning can effectively improve the performance of the policy, but its scaling mechanism has…

机器人学 · 计算机科学 2026-06-25 Wen Ye , Peiyan Li , Tingyu Yuan , Yuan Xu , Xiangnan Wu , Chaoyang Zhao , Jing Liu , Nianfeng Liu , Yan Huang , Liang Wang

Advancing Omnimodal Embodied Agents from Isolated Skills to Everyday Physical Autonomy

Building persistent embodied agents in unstructured environments demands unified orchestration of heterogeneous tools spanning both cyber (APIs, IoT) and physical (manipulation, navigation) domains, coupled with autonomous recovery from…

机器人学 · 计算机科学 2026-06-25 Junhao Shi , Zezheng Huai , Siyin Wang , Jia Chen , Yubang Wang , Zhaoye Fei , Hechang Chen , Jingjing Gong , Xipeng Qiu , Yu-Gang Jiang

HumanoidUMI: Bridging Robot-Free Demonstrations and Humanoid Whole-Body Manipulation

High-quality demonstration data are essential for humanoid robot skill learning, especially for whole-body behaviors that require coordinated perception, locomotion, and manipulation. Existing data-collection methods largely rely on robot…

机器人学 · 计算机科学 2026-06-25 Hongwu Wang , Chenhao Yu , Youhao Hu , Jiachen Zhang , Yuanyuan Li , Shaqi Luo

Learning to Fold: prizewinning solution at LeHome Challenge 2026 (1st place online, 2nd offline)

I describe my solution to the LeHome Challenge 2026, an ICRA 2026 competition on bimanual garment folding. The system placed 1st of 62 teams in the online (simulation) round and 2nd in the real-world final. It improves a…

机器人学 · 计算机科学 2026-06-25 Ilia Larchenko

PhysReflect-VLA: Physical Feasibility and Self-Reflective Regulation for Reliable Vision-Language-Action Policies

Long-horizon robotic manipulation is highly sensitive to physically infeasible transitions, contact-induced disturbances, and the lack of effective self-correction during execution. Although Vision-Language-Action (VLA) models provide…

机器人学 · 计算机科学 2026-06-25 Jiayu Yang , Tao Yang , Weijun Li , Xiang Chang , Fei Chao , Changjing Shang , Qiang Shen

PAMAE: Phase-Aware-MoE Action Experts Towards Reliable Flow-Matching Vision-Language-Action Policies

Reliable action generation for multi-stage robotic manipulation remains challenging for Vision-Language-Action (VLA) models. While existing flow-matching VLA policies offer strong multimodal grounding and generalization, they typically…

机器人学 · 计算机科学 2026-06-25 Jiayu Yang , Tao Yang , Xiang Chang , Fei Chao , Changjing Shang , Qiang Shen

Proposal-Conditioned Latent Diffusion for Closed-Loop Traffic Scenario Generation

Closed-loop traffic simulation remains challenging because it must generate interactive multi-agent behaviors that are scene-consistent and controllable throughout rollout. Prior diffusion-based approaches achieve strong realism, but their…

机器人学 · 计算机科学 2026-06-25 Shubham Vaijanath Phoolari , Aleyna Kara , Christoph Lauer , Steven Peters

ForesightSafety-VLA: A Unified Diagnostic Safety Benchmark for Vision-Language-Action Models

In embodied intelligence, safety is a prerequisite for reliable robot deployment in the physical world. Current vision-language-action (VLA) models continue to advance toward general-purpose task capability, yet their embodied safety limits…

机器人学 · 计算机科学 2026-06-25 Mingyang Lyu , Yinqian Sun , Yiyang Jia , Sicheng Shen , Moquan Sha , Huangrui Li , Feifei Zhao , Yi Zeng

RelAfford6D: Relational 6D Affordance Graphs for Constraint-Driven Robotic Manipulation

Bridging abstract semantics and precise physical control remains a fundamental challenge in open-world robotic manipulation. While recent data-driven policies show promise, their reliance on isolated contact points or latent affordance…

机器人学 · 计算机科学 2026-06-25 Guodong Zhang , Qichen He , Wenyuan Xie , Shaokai Wu , Yanbiao Ji , Qiuchang Li , Bayram Bayramli , Yue Ding , Hongtao Lu

In-Context Model Predictive Generation: Open-Vocabulary Motion Synthesis from Language Models to Physics

Synthesizing human motion from textual descriptions is essential for immersive digital applications, yet existing methods face a persistent trade-off between semantic fidelity and physical realism. Large language model (LLM)-based…

机器人学 · 计算机科学 2026-06-25 Xiaomeng Fu , Junfan Lin , Yang Liu , Yaowei Wang , Guanbin Li , Liang Lin , Ziliang Chen

RobOralScan: Learning Active Intraoral Scanning for Robotic Dental Reconstruction

Intraoral scanning is widely used for digital optical impressions in prosthodontic, implant, and orthodontic treatment, but full-arch and long-span scanning remain labor-intensive tasks with limited automation. In the confined oral cavity,…

机器人学 · 计算机科学 2026-06-25 Jinhyung Lee , Haeun Yun , Siwon Kim , Gihyun Baek , Sungho Moon , Sehyun Hwang , Sunghoon Im