机器人学 — Scifaro

Machine Learning-based Feedback Linearization Control of Quadrotor Subject to Unmodeled Dynamics

The control of agile quadrotors in dynamic and uncertain environments remains an open area of investigation to this day, particularly when the complete system dynamics are partially known or highly nonlinear. This work introduces a novel…

机器人学 · 计算机科学 2026-06-30 Amos Alwala , Gabriel da Silva Lima , Wallace Moreira Bessa

Diffusion-based 4D Trajectory Prediction and Distributed Control for UAV Swarms

Accurate 4D trajectory prediction and closed-loop tracking are essential for Unmanned Aerial Vehicle (UAV) swarms to achieve safe and efficient operations in complex low-altitude environments such as urban airspaces, industrial sites, and…

机器人学 · 计算机科学 2026-06-30 Tianshun Li , Hongliang Lu , Haoang Li , Xinhu Zheng

MIRTH: Mutual-Information Reasoning with Temporal Hubs for Vision-Language-Action Agents

VLA models have emerged as a powerful paradigm for transferring semantic knowledge from web-scale data to physical robotic control. However, current single-frame architectures suffer from intrinsic limitations: temporal myopia that discards…

机器人学 · 计算机科学 2026-06-30 Hao Sun , Yu Song , Shiyu Teng , Ziwei Niu , Yen-Wei Chen

LLM-Powered Interactive Robotic Action Synthesis from Multimodal Speech, Gestures, and Music

The quest for intuitive and natural human-robot interaction (HRI) remains a significant challenge in robotics. Traditional methods often rely on rigid, pre-programmed commands that limit the robot's expressiveness and adaptability. This…

机器人学 · 计算机科学 2026-06-30 Snehasis Banerjee , Ranjan Dasgupta

A Modular Vision-Language-Action Robotics Framework for Indoor Environments

This paper presents an integrated system for the CMU Vision-Language-Action (VLA) Challenge, designed to enable an autonomous agent to perform complex tasks based on natural language instructions. Our framework employs a modular…

机器人学 · 计算机科学 2026-06-30 Anindya Jana , Snehasis Banerjee , Arup Sadhu , Ranjan Dasgupta

ELASTIC: Efficiently Learning to Adaptively Scale Test-Time Compute for Generative Control Policies

Generative control policies (GCPs), such as diffusion policies and flow-based vision-language-action models, enable test-time scaling in robot control. Test-time compute can be allocated along two axes: sequential scaling, which increases…

机器人学 · 计算机科学 2026-06-30 Andrew Zou Li , Gokul Swamy , Yonatan Bisk , Andrea Bajcsy

What Probing Reveals about Autonomous Driving: Linking Internal Prediction Errors to Ego Planning

Large-scale datasets and fast simulators have enabled improvements in driving policies that appear safe and robust, yet strong performance in nominal scenarios can still mask flawed reasoning and unsafe heuristics. Summary scores from…

机器人学 · 计算机科学 2026-06-30 Hyeonchang Jeon , Kyungbeom Kim , Eugene Vinitsky , Kyung-Joong Kim

Efficient Sim-to-Real Transfer of World-Action Models from Synthetic Priors

Bridging the sim-to-real gap is a core challenge in deploying learned manipulation policies. Sim-to-real learning is attractive because it can replace expensive real robot demonstrations with scalable synthetic data, yet world-action models…

机器人学 · 计算机科学 2026-06-30 Zixing Wang , Kausik Sivakumar , Jinghuan Shang , Yafei Hu , Zhaoming Xie , Ran Gong , Xiaohan Zhang , Karl Schmeckpeper

Ground Plane-Aided Extrinsic Calibration of Inertial and RGB-D Sensors for Uncrewed Aerial Vehicles

Accurate extrinsic calibration of inertial sensors, such as Inertial Measurement Units (IMUs) and cameras is crucial for trajectory estimation of Uncrewed Aerial Vehicles (UAVs). While numerous calibration methods have been proposed, these…

机器人学 · 计算机科学 2026-06-30 Ilyar Asl Sabbaghian Hokmabadi , Mahdis Bisheban

Motion Planning in Compressed Representation Spaces

Deep learning methods have vastly expanded the capabilities of motion planning in robotics applications, as learning priors from large-scale data has been shown to be essential in capturing the highly complex behavior required for solving…

机器人学 · 计算机科学 2026-06-29 Lukas Lao Beyer , Sertac Karaman

Sampling-Based Coordination-Informed Multi-Objective Multi-Robot Reinforcement Learning

Multi-robot systems must simultaneously optimize competing objectives while maintaining coordinated behavior. Existing multi-agent reinforcement learning approaches often rely on fixed or centralized coordination, which limits adaptability…

机器人学 · 计算机科学 2026-06-29 Antonio Marino , Esteban Restrepo , Soon-jo Chung , Paolo Robuffo Giordano , Claudio Pacchierotti

Robustness-Based Synthesis for Time Window Temporal Logic Specifications via Mixed-Integer Linear Programming

Time Window Temporal Logic (TWTL) is a rich specification language for cyber-physical systems that can compactly express sequential tasks with explicit timing constraints. In this paper, we consider the problem of synthesizing control…

机器人学 · 计算机科学 2026-06-29 Philip Smith , Ahmad Ahmad , Kevin Leahy

TAPE: Tether-Aware Path Planning for Autonomous Exploration of Unknown 3D Cavities Using a Tangle-Compatible Tethered Aerial Robot

This letter presents the first method for autonomous exploration of unknown cavities in three dimensions (3D) that focuses on minimizing the distance traveled and the length of tether unwound. Considering that the tether entanglements are…

机器人学 · 计算机科学 2026-06-29 Louis Petit , Alexis Lussier Desbiens

Off the Rails: Hijacking the Scoring Head in Generative End-to-End Driving Planners with Safety-Violating Adversarial Perturbations

Generative models have recently seen rapid adoption in End-to-End (E2E) autonomous driving (AD), with diffusion-based denoising and vocabulary-based retrieval becoming the dominant trajectory-decoding paradigms. Despite their architectural…

机器人学 · 计算机科学 2026-06-29 Halima Bouzidi , Mboutidem Ekemini Mkpong , Haoyu Liu , Mohammad Abdullah Al Faruque

Wind and State Estimation on SE(3): Comparative Evaluation of EKF and UKF with Continuous and Discrete Quadrotor Models

Use of quadrotor UAVs for wind velocity estimation is gaining popularity in recent studies, leveraging their maneuverability, compact size and low cost. Among available approaches, model-based wind velocity estimation is most commonly used,…

机器人学 · 计算机科学 2026-06-29 Hiranya Udagedara , Adam Bigsby , Mahdis Bisheban

From Grasps to Dexterity: Large-Scale Grasp Pretraining for Dexterous Manipulation

Large-scale dexterous grasp datasets encode rich priors over hand-object interaction, but their use has largely been confined to grasp generation and pick-and-place manipulation. We study whether such data can instead support functional…

机器人学 · 计算机科学 2026-06-29 Ying Yuan , Xinyu Liu , Sriram Krishna , David Held

Vision-Language Procedural Reasoning for Context-Aware Reward Modeling of Robotic Endovascular Guidewire Navigation

Robotic-assisted endovascular interventions demand accurate, stable, and context-aware guidewire navigation in complex and patient-specific vascular anatomies. Despite recent advances in robotic precision and learning-based control,…

机器人学 · 计算机科学 2026-06-29 Wentong Tian , Jiyuan Zhao , Tianliang Yao , Yuxiang Fan , Zhengyu Shi , Dong Liu , Peng Qi

ViTL: Temporal Logic-Guided Zero-Shot Natural Language Navigation via Vision-Language Models

Enabling robots to follow natural language commands to complete zero-shot long-horizon tasks remains challenging. It requires extracting implicit temporal and logical constraints from natural language commands and executing multiple…

机器人学 · 计算机科学 2026-06-29 Kaier Liang , Hengde Dai , Cristian-Ioan Vasile

DSIP: A Dynamic Coordination Planner for Signal-Free Intersections using Diffusion-Model-Based Multi-Agent Motion Planning

Traffic signal control at urban intersections inherently introduces stop-and-go behavior, resulting in increased delays and reduced traffic efficiency, especially under high traffic demand. With the emergence of connected and automated…

机器人学 · 计算机科学 2026-06-29 Qian Hu , Haoyang Peng , Songan Zhang , Ming Yang , Hongtei Eric Tseng

VLK: Learning Humanoid Loco-Manipulation from Synthetic Interactions in Reconstructed Scenes

Perception-based humanoid loco-manipulation requires connecting egocentric observations and task instructions to whole-body motion. Learning this mapping requires synchronized egocentric images, language commands, and robot-compatible…

机器人学 · 计算机科学 2026-06-29 Yen-Jen Wang , Jiaman Li , Sirui Chen , Takara E. Truong , Pei Xu , Pieter Abbeel , Rocky Duan , Koushil Sreenath , Angjoo Kanazawa , Carmelo Sferrazza , Guanya Shi , Karen Liu