机器人学 — Scifaro

EgoDemoGen: Egocentric Demonstration Generation for Viewpoint Generalization in Robotic Manipulation

Imitation learning based visuomotor policies have achieved strong performance in robotic manipulation, yet they often remain sensitive to egocentric viewpoint shifts. Unlike third-person viewpoint changes that only move the camera,…

机器人学 · 计算机科学 2026-03-31 Yuan Xu , Jiabing Yang , Xiaofeng Wang , Yixiang Chen , Zheng Zhu , Bowen Fang , Guan Huang , Xinze Chen , Yun Ye , Qiang Zhang , Peiyan Li , Xiangnan Wu , Kai Wang , Bing Zhan , Shuo Lu , Jing Liu , Nianfeng Liu , Yan Huang , Liang Wang

Omni-LIVO: Robust RGB-Colored Multi-Camera Visual-Inertial-LiDAR Odometry via Photometric Migration and ESIKF Fusion

Wide field-of-view (FoV) LiDAR sensors provide dense geometry across large environments, but existing LiDAR-inertial-visual odometry (LIVO) systems generally rely on a single camera, limiting their ability to fully exploit LiDAR-derived…

机器人学 · 计算机科学 2026-03-31 Yinong Cao , Chenyang Zhang , Xin He , Yuwei Chen , Chengyu Pu , Bingtao Wang , Kaile Wu , Shouzheng Zhu , Fei Han , Shijie Liu , Chunlai Li , Jianyu Wang

Object-Reconstruction-Aware Whole-body Control of Mobile Manipulators

Object reconstruction and inspection tasks play a crucial role in various robotics applications. Identifying paths that reveal the most unknown areas of the object is paramount in this context, as it directly affects reconstruction…

机器人学 · 计算机科学 2026-03-31 Fatih Dursun , Bruno Vilhena Adorno , Simon Watson , Wei Pan

OVSegDT: Segmenting Transformer for Open-Vocabulary Object Goal Navigation

Open-vocabulary Object Goal Navigation requires an embodied agent to reach objects described by free-form language, including categories never seen during training. Existing end-to-end policies overfit small simulator datasets, achieving…

机器人学 · 计算机科学 2026-03-31 Tatiana Zemskova , Aleksei Staroverov , Dmitry Yudin , Aleksandr Panov

Service Discovery-Based Hybrid Network Middleware for Efficient Communication in Distributed Robotic Systems

Robotic middleware is fundamental to ensuring reliable communication among system components and is crucial for intelligent robotics, autonomous vehicles, and smart manufacturing. However, existing robotic middleware often struggles to meet…

机器人学 · 计算机科学 2026-03-31 Shiyao Sang , Yinggang Ling

Goal-VLA: Image-Generative VLMs as Object-Centric World Models Empowering Zero-shot Robot Manipulation

Generalization remains a fundamental challenge in robotic manipulation. To tackle this challenge, recent Vision-Language-Action (VLA) models build policies on top of Vision-Language Models (VLMs), seeking to transfer their open-world…

机器人学 · 计算机科学 2026-03-31 Haonan Chen , Jingxiang Guo , Bangjun Wang , Tianrui Zhang , Xuchuan Huang , Boren Zheng , Yiwen Hou , Chenrui Tie , Jiajun Deng , Lin Shao

Integrating Maneuverable Planning and Adaptive Control for Robot Cart-Pushing under Disturbances

Precise and flexible cart-pushing is a challenging task for mobile robots. The motion constraints during cart-pushing and the robot's redundancy lead to complex motion planning problems, while variable payloads and disturbances present…

机器人学 · 计算机科学 2026-03-31 Zhe Zhang , Peijia Xie , Yuhan Pang , Zhirui Sun , Bingyi Xia , Bi-Ke Zhu , Jiankun Wang

VLM-SAFE: Vision-Language Model-Guided Safety-Aware Reinforcement Learning with World Models for Autonomous Driving

Autonomous driving policy learning with reinforcement learning (RL) is fundamentally limited by low sample efficiency, weak generalization, and a dependence on unsafe online trial-and-error interactions. Although safe RL introduces explicit…

机器人学 · 计算机科学 2026-03-31 Yansong Qu , Zilin Huang , Zihao Sheng , Jiancong Chen , Yue Leng , Samuel Labi , Sikai Chen

3D CAVLA: Leveraging Depth and 3D Context to Generalize Vision Language Action Models for Unseen Tasks

Robotic manipulation in 3D requires effective computation of N degree-of-freedom joint-space trajectories that enable precise and robust control. To achieve this, robots must integrate semantic understanding with visual perception to…

机器人学 · 计算机科学 2026-03-31 Vineet Bhat , Yu-Hsiang Lan , Prashanth Krishnamurthy , Ramesh Karri , Farshad Khorrami

Hybrid Action Based Reinforcement Learning for Multi-Objective Compatible Autonomous Driving

Reinforcement Learning (RL) has shown excellent performance in solving decision-making and control problems of autonomous driving, which is increasingly applied in diverse driving scenarios. However, driving is a multi-attribute problem,…

机器人学 · 计算机科学 2026-03-31 Guizhe Jin , Zhuoren Li , Bo Leng , Wei Han , Lu Xiong , Chen Sun

Integrated Shape-Force Estimation for Continuum Robots: A Virtual-Work and Polynomial-Curvature Framework

Cable-driven continuum robots (CDCRs) are widely used in surgical and inspection tasks that require dexterous manipulation in confined spaces. Existing model-based estimation methods either assume constant curvature or rely on…

机器人学 · 计算机科学 2026-03-31 Guoqing Zhang , Zihan Chen , Long Wang

Continual Robot Skill and Task Learning via Dialogue

Interactive robot learning is a challenging problem as the robot is present with human users who expect the robot to learn novel skills to solve novel tasks perpetually with sample efficiency. In this work we present a framework for robots…

机器人学 · 计算机科学 2026-03-31 Weiwei Gu , Suresh Kondepudi , Anmol Gupta , Lixiao Huang , Nakul Gopalan

Mobile Robot Exploration Without Maps via Out-of-Distribution Deep Reinforcement Learning

Autonomous Mobile Robot (AMR) navigation in dynamic environments that may be GPS denied, without a-priori maps, is an unsolved problem with potential to improve humanity's capabilities. Conventional modular methods are computationally…

机器人学 · 计算机科学 2026-03-31 Shathushan Sivashangaran , Apoorva Khairnar , Azim Eskandarian

VLA-OPD: Bridging Offline SFT and Online RL for Vision-Language-Action Models via On-Policy Distillation

Although pre-trained Vision-Language-Action (VLA) models exhibit impressive generalization in robotic manipulation, post-training remains crucial to ensure reliable performance during deployment. However, standard offline Supervised…

机器人学 · 计算机科学 2026-03-30 Zhide Zhong , Haodong Yan , Junfeng Li , Junjie He , Tianran Zhang , Haoang Li

Partial Motion Imitation for Learning Cart Pushing with Legged Manipulators

Loco-manipulation is a key capability for legged robots to perform practical mobile manipulation tasks, such as transporting and pushing objects, in real-world environments. However, learning robust loco-manipulation skills remains…

机器人学 · 计算机科学 2026-03-30 Mili Das , Morgan Byrd , Donghoon Baek , Sehoon Ha

Meta-Adaptive Beam Search Planning for Transformer-Based Reinforcement Learning Control of UAVs with Overhead Manipulators under Flight Disturbances

Drones equipped with overhead manipulators offer unique capabilities for inspection, maintenance, and contact-based interaction. However, the motion of the drone and its manipulator is tightly linked, and even small attitude changes caused…

机器人学 · 计算机科学 2026-03-30 Hazim Alzorgan , Sayed Pedram Haeri Boroujeni , Abolfazl Razi

Addressing Ambiguity in Imitation Learning through Product of Experts based Negative Feedback

Programming robots to perform complex tasks is often difficult and time consuming, requiring expert knowledge and skills in robot software and sometimes hardware. Imitation learning is a method for training robots to perform tasks by…

机器人学 · 计算机科学 2026-03-30 John Bateman , Andy M. Tyrrell , Jihong Zhu

Adapt as You Say: Online Interactive Bimanual Skill Adaptation via Human Language Feedback

Developing general-purpose robots capable of autonomously operating in human living environments requires the ability to adapt to continuously evolving task conditions. However, adapting high-dimensional coordinated bimanual skills to novel…

机器人学 · 计算机科学 2026-03-30 Zhuo Li , Dianxi Li , Tao Teng , Quentin Rouxel , Zhipeng Dong , Dennis Hong , Darwin Caldwell , Fei Chen

DTP-Attack: A decision-based black-box adversarial attack on trajectory prediction

Trajectory prediction systems are critical for autonomous vehicle safety, yet remain vulnerable to adversarial attacks that can cause catastrophic traffic behavior misinterpretations. Existing attack methods require white-box access with…

机器人学 · 计算机科学 2026-03-30 Jiaxiang Li , Jun Yan , Daniel Watzenig , Huilin Yin

120 Minutes and a Laptop: Minimalist Image-goal Navigation via Unsupervised Exploration and Offline RL

The prevailing paradigm for image-goal visual navigation often assumes access to large-scale datasets, substantial pretraining, and significant computational resources. In this work, we challenge this assumption. We show that we can collect…

机器人学 · 计算机科学 2026-03-30 Xiaoming Liu , Borong Zhang , Qingbiao Li , Steven Morad