Related papers: SimVODIS: Simultaneous Visual Odometry, Object Det…

SPORTS: Simultaneous Panoptic Odometry, Rendering, Tracking and Segmentation for Urban Scenes Understanding

The scene perception, understanding, and simulation are fundamental techniques for embodied-AI agents, while existing solutions are still prone to segmentation deficiency, dynamic objects' interference, sensor data sparsity, and…

Computer Vision and Pattern Recognition · Computer Science 2025-10-15 Zhiliu Yang , Jinyu Dai , Jianyuan Zhang , Zhu Yang

Embodied Visual Active Learning for Semantic Segmentation

We study the task of embodied visual active learning, where an agent is set to explore a 3d environment with the goal to acquire visual scene understanding by actively selecting views for which to request annotation. While accurate on some…

Computer Vision and Pattern Recognition · Computer Science 2020-12-18 David Nilsson , Aleksis Pirinen , Erik Gärtner , Cristian Sminchisescu

Multi-Agent Embodied Visual Semantic Navigation with Scene Prior Knowledge

In visual semantic navigation, the robot navigates to a target object with egocentric visual observations and the class label of the target is given. It is a meaningful task inspiring a surge of relevant research. However, most of the…

Artificial Intelligence · Computer Science 2021-09-21 Xinzhu Liu , Di Guo , Huaping Liu , Fuchun Sun

Simultaneous Object Detection and Semantic Segmentation

Both object detection in and semantic segmentation of camera images are important tasks for automated vehicles. Object detection is necessary so that the planning and behavior modules can reason about other road users. Semantic segmentation…

Computer Vision and Pattern Recognition · Computer Science 2020-02-14 Niels Ole Salscheider

Dynamic Objects Segmentation for Visual Localization in Urban Environments

Visual localization and mapping is a crucial capability to address many challenges in mobile robotics. It constitutes a robust, accurate and cost-effective approach for local and global pose estimation within prior maps. Yet, in highly…

Computer Vision and Pattern Recognition · Computer Science 2018-07-11 Guoxiang Zhou , Berta Bescos , Marcin Dymczyk , Mark Pfeiffer , José Neira , Roland Siegwart

Siamese Network with Interactive Transformer for Video Object Segmentation

Semi-supervised video object segmentation (VOS) refers to segmenting the target object in remaining frames given its annotation in the first frame, which has been actively studied in recent years. The key challenge lies in finding effective…

Computer Vision and Pattern Recognition · Computer Science 2021-12-30 Meng Lan , Jing Zhang , Fengxiang He , Lefei Zhang

Object Manipulation via Visual Target Localization

Object manipulation is a critical skill required for Embodied AI agents interacting with the world around them. Training agents to manipulate objects, poses many challenges. These include occlusion of the target object by the agent's arm,…

Computer Vision and Pattern Recognition · Computer Science 2022-03-16 Kiana Ehsani , Ali Farhadi , Aniruddha Kembhavi , Roozbeh Mottaghi

A2VIS: Amodal-Aware Approach to Video Instance Segmentation

Handling occlusion remains a significant challenge for video instance-level tasks like Multiple Object Tracking (MOT) and Video Instance Segmentation (VIS). In this paper, we propose a novel framework, Amodal-Aware Video Instance…

Computer Vision and Pattern Recognition · Computer Science 2025-04-11 Minh Tran , Thang Pham , Winston Bounsavy , Tri Nguyen , Ngan Le

DeepVO: A Deep Learning approach for Monocular Visual Odometry

Deep Learning based techniques have been adopted with precision to solve a lot of standard computer vision problems, some of which are image classification, object detection and segmentation. Despite the widespread success of these…

Computer Vision and Pattern Recognition · Computer Science 2016-11-21 Vikram Mohanty , Shubh Agrawal , Shaswat Datta , Arna Ghosh , Vishnu Dutt Sharma , Debashish Chakravarty

Multimotion Visual Odometry (MVO): Simultaneous Estimation of Camera and Third-Party Motions

Estimating motion from images is a well-studied problem in computer vision and robotics. Previous work has developed techniques to estimate the motion of a moving camera in a largely static environment (e.g., visual odometry) and to segment…

Robotics · Computer Science 2019-03-01 Kevin M. Judd , Jonathan D. Gammell , Paul Newman

Automatic Segmentation of Dynamic Objects from an Image Pair

Automatic segmentation of objects from a single image is a challenging problem which generally requires training on large number of images. We consider the problem of automatically segmenting only the dynamic objects from a given pair of…

Computer Vision and Pattern Recognition · Computer Science 2016-04-19 Sri Raghu Malireddi , Shanmuganathan Raman

Video Object Segmentation with Dynamic Query Modulation

Storing intermediate frame segmentations as memory for long-range context modeling, spatial-temporal memory-based methods have recently showcased impressive results in semi-supervised video object segmentation (SVOS). However, these methods…

Computer Vision and Pattern Recognition · Computer Science 2024-03-19 Hantao Zhou , Runze Hu , Xiu Li

Modality-invariant Visual Odometry for Embodied Vision

Effectively localizing an agent in a realistic, noisy setting is crucial for many embodied vision tasks. Visual Odometry (VO) is a practical substitute for unreliable GPS and compass sensors, especially in indoor environments. While…

Computer Vision and Pattern Recognition · Computer Science 2023-05-02 Marius Memmel , Roman Bachmann , Amir Zamir

VIODE: A Simulated Dataset to Address the Challenges of Visual-Inertial Odometry in Dynamic Environments

Dynamic environments such as urban areas are still challenging for popular visual-inertial odometry (VIO) algorithms. Existing datasets typically fail to capture the dynamic nature of these environments, therefore making it difficult to…

Robotics · Computer Science 2021-02-12 Koji Minoda , Fabian Schilling , Valentin Wüest , Dario Floreano , Takehisa Yairi

Towards Deeply Unified Depth-aware Panoptic Segmentation with Bi-directional Guidance Learning

Depth-aware panoptic segmentation is an emerging topic in computer vision which combines semantic and geometric understanding for more robust scene interpretation. Recent works pursue unified frameworks to tackle this challenge but mostly…

Computer Vision and Pattern Recognition · Computer Science 2023-08-15 Junwen He , Yifan Wang , Lijun Wang , Huchuan Lu , Jun-Yan He , Jin-Peng Lan , Bin Luo , Yifeng Geng , Xuansong Xie

Look, Listen, and Act: Towards Audio-Visual Embodied Navigation

A crucial ability of mobile intelligent agents is to integrate the evidence from multiple sensory inputs in an environment and to make a sequence of actions to reach their goals. In this paper, we attempt to approach the problem of…

Computer Vision and Pattern Recognition · Computer Science 2020-03-10 Chuang Gan , Yiwei Zhang , Jiajun Wu , Boqing Gong , Joshua B. Tenenbaum

OVSNet : Towards One-Pass Real-Time Video Object Segmentation

Video object segmentation aims at accurately segmenting the target object regions across consecutive frames. It is technically challenging for coping with complicated factors (e.g., shape deformations, occlusion and out of the lens). Recent…

Computer Vision and Pattern Recognition · Computer Science 2019-07-03 Peng Sun , Peiwen Lin , Guangliang Cheng , Jianping Shi , Jiawan Zhang , Xi Li

EdgeVO: An Efficient and Accurate Edge-based Visual Odometry

Visual odometry is important for plenty of applications such as autonomous vehicles, and robot navigation. It is challenging to conduct visual odometry in textureless scenes or environments with sudden illumination changes where popular…

Robotics · Computer Science 2023-02-21 Hui Zhao , Jianga Shang , Kai Liu , Chao Chen , Fuqiang Gu

ADUGS-VINS: Generalized Visual-Inertial Odometry for Robust Navigation in Highly Dynamic and Complex Environments

Visual-inertial odometry (VIO) is widely used in various fields, such as robots, drones, and autonomous vehicles. However, real-world scenes often feature dynamic objects, compromising the accuracy of VIO. The diversity and partial…

Computer Vision and Pattern Recognition · Computer Science 2025-03-04 Rui Zhou , Jingbin Liu , Junbin Xie , Jianyu Zhang , Yingze Hu , Jiele Zhao

AIMS: All-Inclusive Multi-Level Segmentation

Despite the progress of image segmentation for accurate visual entity segmentation, completing the diverse requirements of image editing applications for different-level region-of-interest selections remains unsolved. In this paper, we…

Computer Vision and Pattern Recognition · Computer Science 2023-05-30 Lu Qi , Jason Kuen , Weidong Guo , Jiuxiang Gu , Zhe Lin , Bo Du , Yu Xu , Ming-Hsuan Yang