机器人学 — Scifaro

ReViP: Mitigating False Completion in Vision-Language-Action Models with Vision-Proprioception Rebalance

Vision-Language-Action (VLA) models have advanced robotic manipulation by combining vision, language, and proprioception to predict actions. However, previous methods fuse proprioceptive signals directly with vision-language features,…

机器人学 · 计算机科学 2026-03-13 Zhuohao Li , Yinghao Li , Jian-Jian Jiang , Lang Zhou , Tianyu Zhang , Jiadong Yin , Mu Lin , Yi-Lin Wei , Wei-Shi Zheng

FSAG: Enhancing Human-to-Dexterous-Hand Finger-Specific Affordance Grounding via Diffusion Models

Dexterous grasp synthesis must jointly satisfy functional intent and physical feasibility, yet existing pipelines often decouple semantic grounding from refinement, yielding unstable or non-functional contacts under object and pose…

机器人学 · 计算机科学 2026-03-13 Yifan Han , Yichuan Peng , Pengfei Yi , Junyan Li , Hanqing Wang , Gaojing Zhang , Qi Peng Liu , Wenzhao Lian

KnowVal: A Knowledge-Augmented and Value-Guided Autonomous Driving System

Visual-language reasoning, driving knowledge, and value alignment are essential for advanced autonomous driving systems. However, existing approaches largely rely on data-driven learning, making it difficult to capture the complex logic…

机器人学 · 计算机科学 2026-03-13 Zhongyu Xia , Wenhao Chen , Yongtao Wang , Ming-Hsuan Yang

POrTAL: Plan-Orchestrated Tree Assembly for Lookahead

When tasking robots in partially observable environments, these robots must efficiently and robustly plan to achieve task goals under uncertainty. Although many probabilistic planning algorithms exist for this purpose, these algorithms can…

机器人学 · 计算机科学 2026-03-13 Evan Conway , David Porfirio , David Chan , Mark Roberts , Laura M. Hiatt

Time as a Control Dimension in Robot Learning

Temporal awareness plays a central role in intelligent behavior by shaping how actions are paced, coordinated, and adapted to changing goals and environments. In contrast, most robot learning algorithms treat time only as a fixed episode…

机器人学 · 计算机科学 2026-03-13 Yinsen Jia , Boyuan Chen

GUIDES: Guidance Using Instructor-Distilled Embeddings for Pre-trained Robot Policy Enhancement

Pre-trained robot policies serve as the foundation of many validated robotic systems, which encapsulate extensive embodied knowledge. However, they often lack the semantic awareness characteristic of foundation models, and replacing them…

机器人学 · 计算机科学 2026-03-13 Minquan Gao , Xinyi Li , Qing Yan , Xiaojian Sun , Xiaopan Zhang , Chien-Ming Huang , Jiachen Li

When Semantics Connect the Swarm: LLM-Driven Fuzzy Control for Cooperative Multi-Robot Underwater Coverage

Underwater multi-robot cooperative coverage remains challenging due to partial observability, limited communication, environmental uncertainty, and the lack of access to global localization. To address these issues, this paper presents a…

机器人学 · 计算机科学 2026-03-13 Jingzehua Xu , Weihang Zhang , Yangyang Li , Hongmiaoyi Zhang , Guanwen Xie , Jiwei Tang , Shuai Zhang , Yi Li

XGrasp: Gripper-Aware Grasp Detection with Multi-Gripper Data Generation

Real-world robotic systems frequently require diverse end-effectors for different tasks, however most existing grasp detection methods are optimized for a single gripper type, demanding retraining or optimization for each novel gripper…

机器人学 · 计算机科学 2026-03-13 Yeonseo Lee , Jungwook Mun , Hyosup Shin , Guebin Hwang , Junhee Nam , Taeyeop Lee , Sungho Jo

UniFField: A Generalizable Unified Neural Feature Field for Visual, Semantic, and Spatial Uncertainties in Any Scene

Comprehensive visual, geometric, and semantic understanding of a 3D scene is crucial for successful execution of robotic tasks, especially in unstructured and complex environments. Additionally, to make robust decisions, it is necessary for…

机器人学 · 计算机科学 2026-03-13 Christian Maurer , Snehal Jauhri , Sophie Lueth , Georgia Chalvatzaki

Efficient Construction of Implicit Surface Models From a Single Image for Motion Generation

Implicit representations have been widely applied in robotics for obstacle avoidance and path planning. In this paper, we explore the problem of constructing an implicit distance representation from a single image. Past methods for implicit…

机器人学 · 计算机科学 2026-03-13 Wei-Teng Chu , Tianyi Zhang , Matthew Johnson-Roberson , Weiming Zhi

Online Slip Detection and Friction Coefficient Estimation for Autonomous Racing

Accurate knowledge of the tire-road friction coefficient (TRFC) is essential for vehicle safety, stability, and performance, especially in autonomous racing, where vehicles often operate at the friction limit. However, TRFC cannot be…

机器人学 · 计算机科学 2026-03-13 Christopher Oeltjen , Carson Sobolewski , Saleh Faghfoorian , Lorant Domokos , Giancarlo Vidal , Sriram Yerramsetty , Ivan Ruchkin

ManiVID-3D: Generalizable View-Invariant Reinforcement Learning for Robotic Manipulation via Disentangled 3D Representations

Deploying visual reinforcement learning (RL) policies in real-world manipulation is often hindered by camera viewpoint changes. A policy trained from a fixed front-facing camera may fail when the camera is shifted -- an unavoidable…

机器人学 · 计算机科学 2026-03-13 Zheng Li , Pei Qu , Yufei Jia , Shihui Zhou , Haizhou Ge , Jiahang Cao , Jinni Zhou , Guyue Zhou , Jun Ma

Whleaper: A 10-DOF Flexible Bipedal Wheeled Robot

Wheel-legged robots combine the advantages of both wheeled robots and legged robots, offering versatile locomotion capabilities with excellent stability on challenging terrains and high efficiency on flat surfaces. However, existing…

机器人学 · 计算机科学 2026-03-13 Yinglei Zhu , Sixiao He , Yan Ning , Zhenghao Qi , Zhuoyuan Yong , Yihua Qin , Jianyu Chen

Zero-shot Sim-to-Real Transfer for Reinforcement Learning-based Visual Servoing of Soft Continuum Arms

Soft continuum arms (SCAs) soft and deformable nature presents challenges in modeling and control due to their infinite degrees of freedom and non-linear behavior. This work introduces a reinforcement learning (RL)-based framework for…

机器人学 · 计算机科学 2026-03-13 Hsin-Jung Yang , Mahsa Khosravi , Benjamin Walt , Girish Krishnan , Soumik Sarkar

Inference-Time Enhancement of Generative Robot Policies via Predictive World Modeling

We present Generative Predictive Control (GPC), an inference-time method for improving pretrained behavior-cloning policies without retraining. GPC augments a frozen diffusion policy at deployment with an action-conditioned world model…

机器人学 · 计算机科学 2026-03-13 Han Qi , Haocheng Yin , Aris Zhu , Yilun Du , Heng Yang

PPGuide: Steering Diffusion Policies with Performance Predictive Guidance

Diffusion policies have shown to be very efficient at learning complex, multi-modal behaviors for robotic manipulation. However, errors in generated action sequences can compound over time which can potentially lead to failure. Some…

机器人学 · 计算机科学 2026-03-12 Zixing Wang , Devesh K. Jha , Ahmed H. Qureshi , Diego Romeres

Learning Adaptive Force Control for Contact-Rich Sample Scraping with Heterogeneous Materials

The increasing demand for accelerated scientific discovery, driven by global challenges, highlights the need for advanced AI-driven robotics. Deploying robotic chemists in human-centric labs is key for the next horizon of autonomous…

机器人学 · 计算机科学 2026-03-12 Cenk Cetin , Shreyas Pouli , Gabriella Pizzuto

Contact Coverage-Guided Exploration for General-Purpose Dexterous Manipulation

Deep Reinforcement learning (DRL) has achieved remarkable success in domains with well-defined reward structures, such as Atari games and locomotion. In contrast, dexterous manipulation lacks general-purpose reward formulations and…

机器人学 · 计算机科学 2026-03-12 Zixuan Liu , Ruoyi Qiao , Chenrui Tie , Xuanwei Liu , Yunfan Lou , Chongkai Gao , Zhixuan Xu , Lin Shao

A gripper for flap separation and opening of sealed bags

Separating thin, flexible layers that must be individually grasped is a common but challenging manipulation primitive for most off-the-shelf grippers. A prominent example arises in clinical settings: the opening of sterile flat pouches for…

机器人学 · 计算机科学 2026-03-12 Sergi Foix , Jaume Oriol , Carme Torras , Júlia Borràs

FG-CLTP: Fine-Grained Contrastive Language Tactile Pretraining for Robotic Manipulation

Recent advancements in integrating tactile sensing into vision-language-action (VLA) models have demonstrated transformative potential for robotic perception. However, existing tactile representations predominantly rely on qualitative…

机器人学 · 计算机科学 2026-03-12 Wenxuan Ma , Chaofan Zhang , Yinghao Cai , Guocai Yao , Shaowei Cui , Shuo Wang