Related papers: l-dyno: framework to learn consistent visual featu…

Robot Perception enables Complex Navigation Behavior via Self-Supervised Learning

Learning visuomotor control policies in robotic systems is a fundamental problem when aiming for long-term behavioral autonomy. Recent supervised-learning-based vision and motion perception systems, however, are often separately built with…

Robotics · Computer Science 2020-06-17 Marvin Chancán , Michael Milford

Learning Visual Servoing with Deep Features and Fitted Q-Iteration

Visual servoing involves choosing actions that move a robot in response to observations from a camera, in order to reach a goal configuration in the world. Standard visual servoing approaches typically rely on manually designed features and…

Machine Learning · Computer Science 2017-07-12 Alex X. Lee , Sergey Levine , Pieter Abbeel

Estimation with Fast Landmark Selection in Robot Visual Navigation

We consider the visual feature selection to improve the estimation quality required for the accurate navigation of a robot. We build upon a key property that asserts: contributions of trackable features (landmarks) appear linearly in the…

Robotics · Computer Science 2019-02-05 Hossein K. Mousavi , Nader Motee

Enhancing Feature Tracking Reliability for Visual Navigation using Real-Time Safety Filter

Vision sensors are extensively used for localizing a robot's pose, particularly in environments where global localization tools such as GPS or motion capture systems are unavailable. In many visual navigation systems, localization is…

Robotics · Computer Science 2025-02-04 Dabin Kim , Inkyu Jang , Youngsoo Han , Sunwoo Hwang , H. Jin Kim

DynaMo: In-Domain Dynamics Pretraining for Visuo-Motor Control

Imitation learning has proven to be a powerful tool for training complex visuomotor policies. However, current methods often require hundreds to thousands of expert demonstrations to handle high-dimensional visual observations. A key reason…

Robotics · Computer Science 2024-11-01 Zichen Jeff Cui , Hengkai Pan , Aadhithya Iyer , Siddhant Haldar , Lerrel Pinto

Statistical Uncertainty Learning for Robust Visual-Inertial State Estimation

A fundamental challenge in robust visual-inertial odometry (VIO) is to dynamically assess the reliability of sensor measurements. This assessment is crucial for properly weighting the contribution of each measurement to the state estimate.…

Robotics · Computer Science 2025-10-03 Seungwon Choi , Donggyu Park , Seo-Yeon Hwang , Tae-Wan Kim

Robotic Applications of Pre-Trained Vision-Language Models to Various Recognition Behaviors

In recent years, a number of models that learn the relations between vision and language from large datasets have been released. These models perform a variety of tasks, such as answering questions about images, retrieving sentences that…

Robotics · Computer Science 2024-03-19 Kento Kawaharazuka , Yoshiki Obinata , Naoaki Kanazawa , Kei Okada , Masayuki Inaba

Sample Efficient Dynamics Learning for Symmetrical Legged Robots:Leveraging Physics Invariance and Geometric Symmetries

Model generalization of the underlying dynamics is critical for achieving data efficiency when learning for robot control. This paper proposes a novel approach for learning dynamics leveraging the symmetry in the underlying robotic system,…

Robotics · Computer Science 2022-10-17 Jee-eun Lee , Jaemin Lee , Tirthankar Bandyopadhyay , Luis Sentis

RoRD: Rotation-Robust Descriptors and Orthographic Views for Local Feature Matching

The use of local detectors and descriptors in typical computer vision pipelines work well until variations in viewpoint and appearance change become extreme. Past research in this area has typically focused on one of two approaches to this…

Computer Vision and Pattern Recognition · Computer Science 2022-03-25 Udit Singh Parihar , Aniket Gujarathi , Kinal Mehta , Satyajit Tourani , Sourav Garg , Michael Milford , K. Madhava Krishna

Efficient Greedy Algorithms for Feature Selection in Robot Visual Localization

Robot localization is a fundamental component of autonomous navigation in unknown environments. Among various sensing modalities, visual input from cameras plays a central role, enabling robots to estimate their position by tracking point…

Robotics · Computer Science 2025-11-27 Vivek Pandey , Amirhossein Mollaei , Nader Motee

Latent Object Characteristics Recognition with Visual to Haptic-Audio Cross-modal Transfer Learning

Recognising the characteristics of objects while a robot handles them is crucial for adjusting motions that ensure stable and efficient interactions with containers. Ahead of realising stable and efficient robot motions for…

Robotics · Computer Science 2024-03-19 Namiko Saito , Joao Moura , Hiroki Uchida , Sethu Vijayakumar

Visual Object Recognition in Indoor Environments Using Topologically Persistent Features

Object recognition in unseen indoor environments remains a challenging problem for visual perception of mobile robots. In this letter, we propose the use of topologically persistent features, which rely on the objects' shape information, to…

Computer Vision and Pattern Recognition · Computer Science 2021-07-30 Ekta U. Samani , Xingjian Yang , Ashis G. Banerjee

What to Learn: Features, Image Transformations, or Both?

Long-term visual localization is an essential problem in robotics and computer vision, but remains challenging due to the environmental appearance changes caused by lighting and seasons. While many existing works have attempted to solve it…

Robotics · Computer Science 2023-06-23 Yuxuan Chen , Binbin Xu , Frederike Dümbgen , Timothy D. Barfoot

Simultaneous View and Feature Selection for Collaborative Multi-Robot Perception

Collaborative multi-robot perception provides multiple views of an environment, offering varying perspectives to collaboratively understand the environment even when individual robots have poor points of view or when occlusions are caused…

Robotics · Computer Science 2021-03-09 Brian Reily , Hao Zhang

Learning Appearance and Motion Cues for Panoptic Tracking

Panoptic tracking enables pixel-level scene interpretation of videos by integrating instance tracking in panoptic segmentation. This provides robots with a spatio-temporal understanding of the environment, an essential attribute for their…

Computer Vision and Pattern Recognition · Computer Science 2025-03-13 Juana Valeria Hurtado , Sajad Marvi , Rohit Mohan , Abhinav Valada

Learning-based Relational Object Matching Across Views

Intelligent robots require object-level scene understanding to reason about possible tasks and interactions with the environment. Moreover, many perception tasks such as scene reconstruction, image retrieval, or place recognition can…

Computer Vision and Pattern Recognition · Computer Science 2023-05-05 Cathrin Elich , Iro Armeni , Martin R. Oswald , Marc Pollefeys , Joerg Stueckler

Sparse Representations for Object and Ego-motion Estimation in Dynamic Scenes

Dynamic scenes that contain both object motion and egomotion are a challenge for monocular visual odometry (VO). Another issue with monocular VO is the scale ambiguity, i.e. these methods cannot estimate scene depth and camera motion in…

Computer Vision and Pattern Recognition · Computer Science 2020-08-31 Hirak J Kashyap , Charless Fowlkes , Jeffrey L Krichmar

Robust Ego and Object 6-DoF Motion Estimation and Tracking

The problem of tracking self-motion as well as motion of objects in the scene using information from a camera is known as multi-body visual odometry and is a challenging task. This paper proposes a robust solution to achieve accurate…

Robotics · Computer Science 2020-07-29 Jun Zhang , Mina Henein , Robert Mahony , Viorela Ila

Local Supports Global: Deep Camera Relocalization with Sequence Enhancement

We propose to leverage the local information in image sequences to support global camera relocalization. In contrast to previous methods that regress global poses from single images, we exploit the spatial-temporal consistency in sequential…

Computer Vision and Pattern Recognition · Computer Science 2019-08-14 Fei Xue , Xin Wang , Zike Yan , Qiuyuan Wang , Junqiu Wang , Hongbin Zha

A Vision-Guided Multi-Robot Cooperation Framework for Learning-by-Demonstration and Task Reproduction

This paper presents a vision-based learning-by-demonstration approach to enable robots to learn and complete a manipulation task cooperatively. With this method, a vision system is involved in both the task demonstration and reproduction…

Robotics · Computer Science 2017-06-05 Bidan Huang , Menglong Ye , Su-Lin Lee , Guang-Zhong Yang