Related papers: Learning Camera Performance Models for Active Mult…

Monocular Visual Teach and Repeat Aided by Local Ground Planarity

Visual Teach and Repeat (VT\&R) allows an autonomous vehicle to repeat a previously traversed route without a global positioning system. Existing implementations of VT\&R typically rely on 3D sensors such as stereo cameras for mapping and…

Robotics · Computer Science 2019-08-08 Lee Clement , Jonathan Kelly , Timothy D. Barfoot

Robust Visual Teach and Repeat for UGVs Using 3D Semantic Maps

We propose a Visual Teach and Repeat (VTR) algorithm using semantic landmarks extracted from environmental objects for ground robots with fixed mount monocular cameras. The proposed algorithm is robust to changes in the starting pose of the…

Robotics · Computer Science 2022-06-28 Mohammad Mahdavian , KangKang Yin , Mo Chen

Event-Based Visual Teach-and-Repeat via Fast Fourier-Domain Cross-Correlation

Visual teach-and-repeat (VT&R) navigation enables robots to autonomously traverse previously demonstrated paths using visual feedback. We present a novel event-camera-based VT\&R system. Our system formulates event-stream matching as…

Robotics · Computer Science 2026-03-10 Gokul B. Nair , Alejandro Fontan , Michael Milford , Tobias Fischer

Robust Visual Teach-and-Repeat Navigation with Flexible Topo-metric Graph Map Representation

Visual Teach-and-Repeat Navigation is a direct solution for mobile robot to be deployed in unknown environments. However, robust trajectory repeat navigation still remains challenged due to environmental changing and dynamic objects. In…

Robotics · Computer Science 2025-10-13 Jikai Wang , Yunqi Cheng , Kezhi Wang , Zonghai Chen

An Efficient Locally Reactive Controller for Safe Navigation in Visual Teach and Repeat Missions

To achieve successful field autonomy, mobile robots need to freely adapt to changes in their environment. Visual navigation systems such as Visual Teach and Repeat (VT&R) often assume the space around the reference trajectory is free, but…

Robotics · Computer Science 2022-07-01 Matías Mattamala , Nived Chebrolu , Maurice Fallon

FLAF: Focal Line and Feature-constrained Active View Planning for Visual Teach and Repeat

This paper presents FLAF, a focal line and feature-constrained active view planning method for tracking failure avoidance in feature-based visual navigation of mobile robots. Our FLAF-based visual navigation is built upon a feature-based…

Robotics · Computer Science 2025-07-16 Changfei Fu , Weinan Chen , Wenjun Xu , Hong Zhang

Navigation without localisation: reliable teach and repeat based on the convergence theorem

We present a novel concept for teach-and-repeat visual navigation. The proposed concept is based on a mathematical model, which indicates that in teach-and-repeat navigation scenarios, mobile robots do not need to perform explicit…

Robotics · Computer Science 2018-08-01 Tomas Krajnik , Filip Majer , Lucie Halodova , Tomas Vintr

Multi-cam Multi-map Visual Inertial Localization: System, Validation and Dataset

Robot control loops require causal pose estimates that depend only on past and present measurements. At each timestep, controllers compute commands using the current pose without waiting for future refinements. While traditional visual SLAM…

Robotics · Computer Science 2025-11-11 Yufei Wei , Fuzhang Han , Yanmei Jiao , Zhuqing Zhang , Yiyuan Pan , Wenjun Huang , Li Tang , Huan Yin , Xiaqing Ding , Chenxiao Hu , Rong Xiong , Yue Wang

Multi-Platform Teach-and-Repeat Navigation by Visual Place Recognition Based on Deep-Learned Local Features

Uniform and variable environments still remain a challenge for stable visual localization and mapping in mobile robot navigation. One of the possible approaches suitable for such environments is appearance-based teach-and-repeat navigation,…

Robotics · Computer Science 2025-03-18 Václav Truhlařík , Tomáš Pivoňka , Michal Kasarda , Libor Přeučil

Learning Precise 3D Manipulation from Multiple Uncalibrated Cameras

In this work, we present an effective multi-view approach to closed-loop end-to-end learning of precise manipulation tasks that are 3D in nature. Our method learns to accomplish these tasks using multiple statically placed but uncalibrated…

Robotics · Computer Science 2021-04-02 Iretiayo Akinola , Jacob Varley , Dmitry Kalashnikov

Bidirectional Multirate Reconstruction for Temporal Modeling in Videos

Despite the recent success of neural networks in image feature learning, a major problem in the video domain is the lack of sufficient labeled data for learning to model temporal information. In this paper, we propose an unsupervised…

Computer Vision and Pattern Recognition · Computer Science 2016-11-29 Linchao Zhu , Zhongwen Xu , Yi Yang

Toward Scalable Co-located Practical Learning: Assisting with Computer Vision and Multimodal Analytics

This study examined whether a single ceiling-mounted camera could be used to capture fine-grained learning behaviours in co-located practical learning. In undergraduate nursing simulations, teachers first identified seven observable…

Human-Computer Interaction · Computer Science 2026-03-17 Xinyu Li , Linxuan Zhao , Roberto Martinez-Maldonado , Dragan Gasevic , Lixiang Yan

A Case Study on Visual-Audio-Tactile Cross-Modal Retrieval

Cross-Modal Retrieval (CMR), which retrieves relevant items from one modality (e.g., audio) given a query in another modality (e.g., visual), has undergone significant advancements in recent years. This capability is crucial for robots to…

Robotics · Computer Science 2024-07-31 Jagoda Wojcik , Jiaqi Jiang , Jiacheng Wu , Shan Luo

Visual Teach and Repeat 3 (VT&R3), a generalization of stereo VT&R, achieves long-term autonomous path-following using topometric mapping and localization from a single rich sensor stream. In this paper, we improve the capabilities of a…

Robotics · Computer Science 2022-11-04 Jordy Sehn , Yuchen Wu , Timothy D. Barfoot

VIR-SLAM: Visual, Inertial, and Ranging SLAM for single and multi-robot systems

Monocular cameras coupled with inertial measurements generally give high performance visual inertial odometry. However, drift can be significant with long trajectories, especially when the environment is visually challenging. In this paper,…

Robotics · Computer Science 2020-06-02 Yanjun Cao , Giovanni Beltrame

Hierarchical place recognition with omnidirectional images and curriculum learning-based loss functions

This paper addresses Visual Place Recognition (VPR), which is essential for the safe navigation of mobile robots. The solution we propose employs panoramic images and deep learning models, which are fine-tuned with triplet loss functions…

Robotics · Computer Science 2025-10-03 Marcos Alfaro , Juan José Cabrera , María Flores , Óscar Reinoso , Luis Payá

Multi-Camera View Scaling for Data-Efficient Robot Imitation Learning

The generalization ability of imitation learning policies for robotic manipulation is fundamentally constrained by the diversity of expert demonstrations, while collecting demonstrations across varied environments is costly and difficult in…

Robotics · Computer Science 2026-04-02 Yichen Xie , Yixiao Wang , Shuqi Zhao , Cheng-En Wu , Masayoshi Tomizuka , Jianwen Xie , Hao-Shu Fang

DeepMEL: Compiling Visual Multi-Experience Localization into a Deep Neural Network

Vision-based path following allows robots to autonomously repeat manually taught paths. Stereo Visual Teach and Repeat (VT\&R) accomplishes accurate and robust long-range path following in unstructured outdoor environments across changing…

Robotics · Computer Science 2020-03-09 Mona Gridseth , Timothy D. Barfoot

Active Visual SLAM with Independently Rotating Camera

In active Visual-SLAM (V-SLAM), a robot relies on the information retrieved by its cameras to control its own movements for autonomous mapping of the environment. Cameras are usually statically linked to the robot's body, limiting the extra…

Robotics · Computer Science 2021-11-03 Elia Bonetto , Pascal Goldschmid , Michael J. Black , Aamir Ahmad

MV-TAP: Tracking Any Point in Multi-View Videos

Multi-view camera systems enable rich observations of complex real-world scenes, and understanding dynamic objects in multi-view settings has become central to various applications. In this work, we present MV-TAP, a novel point tracker…

Computer Vision and Pattern Recognition · Computer Science 2025-12-02 Jahyeok Koo , Inès Hyeonsu Kim , Mungyeom Kim , Junghyun Park , Seohyun Park , Jaeyeong Kim , Jung Yi , Seokju Cho , Seungryong Kim