机器人学 — Scifaro

BOWConnect: Parallel Bayesian Optimization over Windows with Learned Local Cost Maps for Sample-Efficient Kinodynamic Motion Planning

This paper presents BOWConnect, a bidirectional parallel kinodynamic motion planner that addresses three fundamental limitations of existing sampling-based methods: sample inefficiency in high-dimensional state spaces, unreliable cost…

机器人学 · 计算机科学 2026-06-25 Sourav Raxit , Abdullah Al Redwan Newaz , Jose Fuentes , Leonardo Bobadilla

E-TTS: A New Embodied Test-Time Scaling Framework for Robotic Manipulation

Recently, a few works have made early attempts to study test-time scaling for embodied tasks. However, two major challenges remain unsolved: (1) reasoning can effectively improve the performance of the policy, but its scaling mechanism has…

机器人学 · 计算机科学 2026-06-25 Wen Ye , Peiyan Li , Tingyu Yuan , Yuan Xu , Xiangnan Wu , Chaoyang Zhao , Jing Liu , Nianfeng Liu , Yan Huang , Liang Wang

Advancing Omnimodal Embodied Agents from Isolated Skills to Everyday Physical Autonomy

Building persistent embodied agents in unstructured environments demands unified orchestration of heterogeneous tools spanning both cyber (APIs, IoT) and physical (manipulation, navigation) domains, coupled with autonomous recovery from…

机器人学 · 计算机科学 2026-06-25 Junhao Shi , Zezheng Huai , Siyin Wang , Jia Chen , Yubang Wang , Zhaoye Fei , Hechang Chen , Jingjing Gong , Xipeng Qiu , Yu-Gang Jiang

HumanoidUMI: Bridging Robot-Free Demonstrations and Humanoid Whole-Body Manipulation

High-quality demonstration data are essential for humanoid robot skill learning, especially for whole-body behaviors that require coordinated perception, locomotion, and manipulation. Existing data-collection methods largely rely on robot…

机器人学 · 计算机科学 2026-06-25 Hongwu Wang , Chenhao Yu , Youhao Hu , Jiachen Zhang , Yuanyuan Li , Shaqi Luo

Learning to Fold: prizewinning solution at LeHome Challenge 2026 (1st place online, 2nd offline)

I describe my solution to the LeHome Challenge 2026, an ICRA 2026 competition on bimanual garment folding. The system placed 1st of 62 teams in the online (simulation) round and 2nd in the real-world final. It improves a…

机器人学 · 计算机科学 2026-06-25 Ilia Larchenko

PhysReflect-VLA: Physical Feasibility and Self-Reflective Regulation for Reliable Vision-Language-Action Policies

Long-horizon robotic manipulation is highly sensitive to physically infeasible transitions, contact-induced disturbances, and the lack of effective self-correction during execution. Although Vision-Language-Action (VLA) models provide…

机器人学 · 计算机科学 2026-06-25 Jiayu Yang , Tao Yang , Weijun Li , Xiang Chang , Fei Chao , Changjing Shang , Qiang Shen

PAMAE: Phase-Aware-MoE Action Experts Towards Reliable Flow-Matching Vision-Language-Action Policies

Reliable action generation for multi-stage robotic manipulation remains challenging for Vision-Language-Action (VLA) models. While existing flow-matching VLA policies offer strong multimodal grounding and generalization, they typically…

机器人学 · 计算机科学 2026-06-25 Jiayu Yang , Tao Yang , Xiang Chang , Fei Chao , Changjing Shang , Qiang Shen

Proposal-Conditioned Latent Diffusion for Closed-Loop Traffic Scenario Generation

Closed-loop traffic simulation remains challenging because it must generate interactive multi-agent behaviors that are scene-consistent and controllable throughout rollout. Prior diffusion-based approaches achieve strong realism, but their…

机器人学 · 计算机科学 2026-06-25 Shubham Vaijanath Phoolari , Aleyna Kara , Christoph Lauer , Steven Peters

ForesightSafety-VLA: A Unified Diagnostic Safety Benchmark for Vision-Language-Action Models

In embodied intelligence, safety is a prerequisite for reliable robot deployment in the physical world. Current vision-language-action (VLA) models continue to advance toward general-purpose task capability, yet their embodied safety limits…

机器人学 · 计算机科学 2026-06-25 Mingyang Lyu , Yinqian Sun , Yiyang Jia , Sicheng Shen , Moquan Sha , Huangrui Li , Feifei Zhao , Yi Zeng

RelAfford6D: Relational 6D Affordance Graphs for Constraint-Driven Robotic Manipulation

Bridging abstract semantics and precise physical control remains a fundamental challenge in open-world robotic manipulation. While recent data-driven policies show promise, their reliance on isolated contact points or latent affordance…

机器人学 · 计算机科学 2026-06-25 Guodong Zhang , Qichen He , Wenyuan Xie , Shaokai Wu , Yanbiao Ji , Qiuchang Li , Bayram Bayramli , Yue Ding , Hongtao Lu

In-Context Model Predictive Generation: Open-Vocabulary Motion Synthesis from Language Models to Physics

Synthesizing human motion from textual descriptions is essential for immersive digital applications, yet existing methods face a persistent trade-off between semantic fidelity and physical realism. Large language model (LLM)-based…

机器人学 · 计算机科学 2026-06-25 Xiaomeng Fu , Junfan Lin , Yang Liu , Yaowei Wang , Guanbin Li , Liang Lin , Ziliang Chen

RobOralScan: Learning Active Intraoral Scanning for Robotic Dental Reconstruction

Intraoral scanning is widely used for digital optical impressions in prosthodontic, implant, and orthodontic treatment, but full-arch and long-span scanning remain labor-intensive tasks with limited automation. In the confined oral cavity,…

机器人学 · 计算机科学 2026-06-25 Jinhyung Lee , Haeun Yun , Siwon Kim , Gihyun Baek , Sungho Moon , Sehyun Hwang , Sunghoon Im

UAV-MapFusion: RTK-Aligned Uncertainty-Aware Coarse-to-Fine Multi-Session UAV Mapping

Large-scale point cloud maps are essential for robotics and spatial intelligence tasks. UAVs provide an efficient means for large-scale map acquisition; however, due to limited flight endurance and onboard storage, mapping a large-scale…

机器人学 · 计算机科学 2026-06-25 Feng Pan , Chunran Zheng , Bing Xue , Yukang Cui , Jiayu Wen , Zhiyu Chen , Wei Wang

Risk-Aware Selective Multimodal Driver Monitoring with Driver-State World Modeling

Continuous driver monitoring in automated vehicles requires low-latency inference while avoiding unsafe decisions under uncertain driver states. Large vision-language models provide broad multimodal priors, but their latency and limited…

机器人学 · 计算机科学 2026-06-25 Daosheng Qiu , Haozhuang Chi , Hao Su , Shu Long , Xinyue Miao , Yongle Dong , Wei Zhang

PlanRL: A Trajectory Planning Architecture for Reinforcement Learning-based Driving Experts

Reinforcement learning (RL) has become a prominent framework for developing driving experts in autonomous vehicles. However, most existing RL-based experts are designed to output direct control commands (e.g., throttle, steering), which…

机器人学 · 计算机科学 2026-06-25 Joonhee Lim , Yongjae Lee , Jangho Shin , Dongsuk Kum

Humanoid-DART: Humanoid Loco-Manipulation using Diffusion-guided Augmentation through Relabeling and Tracking

Imitating human demonstrations has emerged as a dominant paradigm for learning humanoid loco-manipulation policies. However, scaling these approaches remains challenging due to the high cost of collecting diverse demonstrations and the need…

机器人学 · 计算机科学 2026-06-25 Pranav Debbad , Kanish Thiagarajan , Victor Dhédin , Shafeef Omar , Majid Khadiv

Ordinal Neural Collapse as a Representation Prior for Visual Navigation

Learning robust navigation policies directly from visual observations remains a fundamental challenge in vision-based robotic navigation. In end-to-end imitation learning approaches, the visual encoder and action decoder are jointly…

机器人学 · 计算机科学 2026-06-25 E-In Son , Jung-Taak Kim , Seung-Woo Seo

Improving Vision-Language-Action Model Fine-Tuning with Structured Stage and Keyframe Supervision

Vision-Language-Action (VLA) models have shown strong potential for generalizable robotic manipulation. During fine-tuning, however, action supervision applies equally across all timesteps, without structured supervision on which…

机器人学 · 计算机科学 2026-06-25 Yuan Xu , Yixiang Chen , Kai Wang , Jiabing Yang , Peiyan Li , Qisen Ma , Yan Huang , Liang Wang

SSI-Policy: Learning Structured Scene Interfaces for Vision-Language Robotic Manipulation

Real-world robotic manipulation demands spatial grounding, task-aware reasoning, and precise control. Learning such capabilities becomes particularly challenging in the low-data regime. Prior methods often trade off scalable task-level…

机器人学 · 计算机科学 2026-06-25 Kaijun Wang , Zikai Ouyang , Xuping Wu , Jinyi Hong , Wei Pan , Haibo Lu , Jia Pan , Wei Zhang , Linfang Zheng

PressMimic: Pressure-Guided Motion Capture and Control for Humanoid Robot Imitation

Humanoid motion imitation requires not only accurate perception of human kinematics but also faithful reproduction of physical interactions with the environment. However, existing pipelines rely primarily on vision-based motion capture and…

机器人学 · 计算机科学 2026-06-25 Yi Lu , Shenghao Ren , Tianyu Xiong , Zhaoxiang Li , Jiaqi Li , He Zhang , Tao Yu , Qiu Shen , Xun Cao