机器人学 — Scifaro

Bridging the Awareness Gap: Socially Mediated State Externalization for Transparent Distributed Home Robots

Distributed multi-robot systems for the home often require robots to operate out of the user's sight, creating a state awareness gap that can diminish trust and perceived transparency and control. This paper investigates whether real-time,…

机器人学 · 计算机科学 2026-03-31 Wenzheng Zhao , Manideep Duggi , Fengpei Yuan

Contextual Graph Representations for Task-Driven 3D Perception and Planning

Recent advances in computer vision facilitate fully automatic extraction of object-centric relational representations from visual-inertial data. These state representations, dubbed 3D scene graphs, are a hierarchical decomposition of…

机器人学 · 计算机科学 2026-03-31 Christopher Agia

Co-designing a Social Robot for Newcomer Children's Cultural and Language Learning

Newcomer children face barriers in acquiring the host country's language and literacy programs are often constrained by limited staffing, mixed-proficiency cohorts, and short contact time. While Socially Assistive Robots (SARs) show promise…

机器人学 · 计算机科学 2026-03-31 Neil Fernandes , Tehniyat Shahbaz , Emily Davies-Robinson , Yue Hu , Kerstin Dautenhahn

Ruka-v2: Tendon Driven Open-Source Dexterous Hand with Wrist and Abduction for Robot Learning

Lack of accessible and dexterous robot hardware has been a significant bottleneck to achieving human-level dexterity in robots. Last year, we released Ruka, a fully open-sourced, tendon-driven humanoid hand with 11 degrees of freedom - 2…

机器人学 · 计算机科学 2026-03-31 Xinqi Lucas Liu , Ruoxi Hu , Alejandro Ojeda Olarte , Zhuoran Chen , Kenny Ma , Charles Cheng Ji , Lerrel Pinto , Raunaq Bhirangi , Irmak Guzey

The Multi-AMR Buffer Storage, Retrieval, and Reshuffling Problem: Exact and Heuristic Approaches

Buffer zones are essential in production systems to decouple sequential processes. In dense floor storage environments, such as space-constrained brownfield facilities, manual operation is increasingly challenged by severe labor shortages…

机器人学 · 计算机科学 2026-03-31 Max Disselnmeyer , Thomas Bömer , Laura Dörr , Bastian Amberg , Anne Meyer

DecompGrind: A Decomposition Framework for Robotic Grinding via Cutting-Surface Planning and Contact-Force Adaptation

Robotic grinding is widely used for shaping workpieces in manufacturing, but it remains difficult to automate this process efficiently. In particular, efficiently grinding workpieces of different shapes and material hardness is challenging…

机器人学 · 计算机科学 2026-03-31 Shunsuke Araki , Takumi Hachimine , Yuki Saito , Kouhei Ohnishi , Jun Morimoto , Takamitsu Matsubara

Scaling Sim-to-Real Reinforcement Learning for Robot VLAs with Generative 3D Worlds

The strong performance of large vision-language models (VLMs) trained with reinforcement learning (RL) has motivated similar approaches for fine-tuning vision-language-action (VLA) models in robotics. Many recent works fine-tune VLAs…

机器人学 · 计算机科学 2026-03-31 Andrew Choi , Xinjie Wang , Zhizhong Su , Wei Xu

Onboard MuJoCo-based Model Predictive Control for Shipboard Crane with Double-Pendulum Sway Suppression

Transferring heavy payloads in maritime settings relies on efficient crane operation, limited by hazardous double-pendulum payload sway. This sway motion is further exacerbated in offshore environments by external perturbations from wind…

机器人学 · 计算机科学 2026-03-31 Oscar Pang , Lisa Coiffard , Paul Templier , Luke Beddow , Kamil Dreczkowski , Antoine Cully

R3DP: Real-Time 3D-Aware Policy for Embodied Manipulation

Embodied manipulation requires accurate 3D understanding of objects and their spatial relations to plan and execute contact-rich actions. While large-scale 3D vision models provide strong priors, their computational cost incurs prohibitive…

机器人学 · 计算机科学 2026-03-31 Yuhao Zhang , Wanxi Dong , Yue Shi , Yi Liang , Jingnan Gao , Qiaochu Yang , Yaxing Lyu , Zhixuan Liang , Yibin Liu , Congsheng Xu , Xianda Guo , Wei Sui , Yaohui Jin , Xiaokang Yang , Yanyan Xu , Yao Mu

AffordGrasp: Cross-Modal Diffusion for Affordance-Aware Grasp Synthesis

Generating human grasping poses that accurately reflect both object geometry and user-specified interaction semantics is essential for natural hand-object interactions in AR/VR and embodied AI. However, existing semantic grasping approaches…

机器人学 · 计算机科学 2026-03-31 Xiaofei Wu , Yi Zhang , Yumeng Liu , Yuexin Ma , Yujiao Shi , Xuming He

AIM-SLAM: Dense Monocular SLAM via Adaptive and Informative Multi-View Keyframe Prioritization with Foundation Model

Recent advances in geometric foundation models have emerged as a promising alternative for addressing the challenge of dense reconstruction in monocular visual simultaneous localization and mapping (SLAM). Although geometric foundation…

机器人学 · 计算机科学 2026-03-31 Jinwoo Jeon , Dong-Uk Seo , Eungchang Mason Lee , Hyun Myung

Grip as Needed, Glide on Demand: Ultrasonic Lubrication for Robotic Locomotion

Friction is the essential mediator of terrestrial locomotion, yet in robotic systems it is almost always treated as a passive property fixed by surface materials and conditions. Here, we introduce ultrasonic lubrication as a method to…

机器人学 · 计算机科学 2026-03-31 Mostafa A. Atalla , Daan van Bemmel , Jack Cummings , Paul Breedveld , Michaël Wiertlewski , Aimée Sakes

ExtremControl: Low-Latency Humanoid Teleoperation with Direct Extremity Control

Building a low-latency humanoid teleoperation system is essential for collecting diverse reactive and dynamic demonstrations. However, existing approaches rely on heavily pre-processed human-to-humanoid motion retargeting and position-only…

机器人学 · 计算机科学 2026-03-31 Ziyan Xiong , Lixing Fang , Junyun Huang , Kashu Yamazaki , Hao Zhang , Chuang Gan

Mimic Intent, Not Just Trajectories

While imitation learning (IL) has achieved impressive success in dexterous manipulation through generative modeling and pretraining, state-of-the-art approaches like Vision-Language-Action (VLA) models still struggle with adaptation to…

机器人学 · 计算机科学 2026-03-31 Renming Huang , Chendong Zeng , Wenjing Tang , Jintian Cai , Cewu Lu , Panpan Cai

ACoT-VLA: Action Chain-of-Thought for Vision-Language-Action Models

Vision-Language-Action models have emerged as essential generalist robot policies for diverse manipulation tasks, conventionally relying on directly translating multimodal inputs into actions via Vision-Language Model embeddings. Recent…

机器人学 · 计算机科学 2026-03-31 Linqing Zhong , Yi Liu , Yifei Wei , Ziyu Xiong , Maoqing Yao , Si Liu , Guanghui Ren

LaST$_{0}$: Latent Spatio-Temporal Chain-of-Thought for Robotic Vision-Language-Action Model

Vision-Language-Action (VLA) models have recently shown strong generalization, with some approaches seeking to explicitly generate linguistic reasoning traces or predict future observations prior to execution. However, explicit reasoning…

机器人学 · 计算机科学 2026-03-31 Zhuoyang Liu , Jiaming Liu , Hao Chen , Jiale Yu , Ziyu Guo , Chengkai Hou , Chenyang Gu , Xiangju Mi , Renrui Zhang , Kun Wu , Zhengping Che , Jian Tang , Pheng-Ann Heng , Shanghang Zhang

CycleManip: Enabling Cyclic Task Manipulation via Effective Historical Perception and Understanding

In this paper, we explore an important yet underexplored task in robot manipulation: cycle-based manipulation, where robots need to perform cyclic or repetitive actions with an expected terminal time. These tasks are crucial in daily life,…

机器人学 · 计算机科学 2026-03-31 Yi-Lin Wei , Haoran Liao , Yuhao Lin , Pengyue Wang , Zhizhao Liang , Guiliang Liu , Wei-Shi Zheng

FlexiCup: Wireless Multimodal Suction Cup with Dual-Zone Vision-Tactile Sensing

Conventional suction cups lack sensing capabilities for contact-aware manipulation in unstructured environments. This paper presents FlexiCup, a multimodal suction cup with wireless electronics that integrate dual-zone vision-tactile…

机器人学 · 计算机科学 2026-03-31 Junhao Gong , Shoujie Li , Kit-Wa Sou , Changqing Guo , Hourong Huang , Tong Wu , Yifan Xie , Chenxin Liang , Chuqiao Lyu , Xiaojun Liang , Wenbo Ding

ViPRA: Video Prediction for Robot Actions

Can we turn a video prediction model into a robot policy? Videos, including those of humans or teleoperated robots, capture rich physical interactions. However, most of them lack labeled actions, which limits their use in robot learning. We…

机器人学 · 计算机科学 2026-03-31 Sandeep Routray , Hengkai Pan , Unnat Jain , Shikhar Bahl , Deepak Pathak

DIV-Nav: Open-Vocabulary Spatial Relationships for Multi-Object Navigation

Advances in open-vocabulary semantic mapping and object navigation have enabled robots to perform an informed search of their environment for an arbitrary object. However, such zero-shot object navigation is typically designed for simple…

机器人学 · 计算机科学 2026-03-31 Jesús Ortega-Peimbert , Finn Lukas Busch , Timon Homberger , Quantao Yang , Olov Andersson