Related papers: ViewActive: Active viewpoint optimization from a s…

A General One-Shot Multimodal Active Perception Framework for Robotic Manipulation: Learning to Predict Optimal Viewpoint

Active perception in vision-based robotic manipulation aims to move the camera toward more informative observation viewpoints, thereby providing high-quality perceptual inputs for downstream tasks. Most existing active perception methods…

Robotics · Computer Science 2026-01-21 Deyun Qin , Zezhi Liu , Hanqian Luo , Xiao Liang , Yongchun Fang

Active Perception and Representation for Robotic Manipulation

The vast majority of visual animals actively control their eyes, heads, and/or bodies to direct their gaze toward different parts of their environment. In contrast, recent applications of reinforcement learning in robotic manipulation…

Computer Vision and Pattern Recognition · Computer Science 2020-03-17 Youssef Zaky , Gaurav Paruthi , Bryan Tripp , James Bergstra

Integrating Three Mechanisms of Visual Attention for Active Visual Search

Algorithms for robotic visual search can benefit from the use of visual attention methods in order to reduce computational costs. Here, we describe how three distinct mechanisms of visual attention can be integrated and productively used to…

Computer Vision and Pattern Recognition · Computer Science 2018-05-31 Amir Rasouli , John K. Tsotsos

Improving Viewpoint-Independent Object-Centric Representations through Active Viewpoint Selection

Given the complexities inherent in visual scenes, such as object occlusion, a comprehensive understanding often requires observation from multiple viewpoints. Existing multi-viewpoint object-centric learning methods typically employ random…

Computer Vision and Pattern Recognition · Computer Science 2024-11-04 Yinxuan Huang , Chengmin Gao , Bin Li , Xiangyang Xue

BOSfM: A View Planning Framework for Optimal 3D Reconstruction of Agricultural Scenes

Active vision (AV) has been in the spotlight of robotics research due to its emergence in numerous applications including agricultural tasks such as precision crop monitoring and autonomous harvesting to list a few. A major AV problem that…

Robotics · Computer Science 2025-09-30 Athanasios Bacharis , Konstantinos D. Polyzos , Georgios B. Giannakis , Nikolaos Papanikolopoulos

Active Human Pose Estimation via an Autonomous UAV Agent

One of the core activities of an active observer involves moving to secure a "better" view of the scene, where the definition of "better" is task-dependent. This paper focuses on the task of human pose estimation from videos capturing a…

Robotics · Computer Science 2024-07-03 Jingxi Chen , Botao He , Chahat Deep Singh , Cornelia Fermuller , Yiannis Aloimonos

Attention-based Active Visual Search for Mobile Robots

We present an active visual search model for finding objects in unknown environments. The proposed algorithm guides the robot towards the sought object using the relevant stimuli provided by the visual sensors. Existing search strategies…

Robotics · Computer Science 2021-02-08 Amir Rasouli , Pablo Lanillos , Gordon Cheng , John K. Tsotsos

Coverage Optimization for Camera View Selection

What makes a good viewpoint? The quality of the data used to learn 3D reconstructions is crucial for enabling efficient and accurate scene modeling. We study the active view selection problem and develop a principled analysis that yields a…

Computer Vision and Pattern Recognition · Computer Science 2026-04-08 Timothy Chen , Adam Dai , Maximilian Adang , Grace Gao , Mac Schwager

Optimizing Active Perception for Learning Simultaneous Viewpoint Selection and Manipulation with Diffusion Policy

Robotic manipulation tasks often rely on static cameras for perception, which can limit flexibility, particularly in scenarios like robotic surgery and cluttered environments where mounting static cameras is impractical. Ideally, robots…

Robotics · Computer Science 2025-09-18 Xiatao Sun , Francis Fan , Yinxing Chen , Daniel Rakita

Preference-Driven Active 3D Scene Representation for Robotic Inspection in Nuclear Decommissioning

Active 3D scene representation is pivotal in modern robotics applications, including remote inspection, manipulation, and telepresence. Traditional methods primarily optimize geometric fidelity or rendering accuracy, but often overlook…

Robotics · Computer Science 2025-04-04 Zhen Meng , Kan Chen , Xiangmin Xu , Erwin Jose Lopez Pulgarin , Emma Li , Philip G. Zhao , David Flynn

Learning Where to Look: Self-supervised Viewpoint Selection for Active Localization using Geometrical Information

Accurate localization in diverse environments is a fundamental challenge in computer vision and robotics. The task involves determining a sensor's precise position and orientation, typically a camera, within a given space. Traditional…

Computer Vision and Pattern Recognition · Computer Science 2024-07-23 Luca Di Giammarino , Boyang Sun , Giorgio Grisetti , Marc Pollefeys , Hermann Blum , Daniel Barath

Peering into the Unknown: Active View Selection with Neural Uncertainty Maps for 3D Reconstruction

Some perspectives naturally provide more information than others. How can an AI system determine which viewpoint offers the most valuable insight for accurate and efficient 3D object reconstruction? Active view selection (AVS) for 3D…

Computer Vision and Pattern Recognition · Computer Science 2026-02-25 Zhengquan Zhang , Feng Xu , Mengmi Zhang

ViewFool: Evaluating the Robustness of Visual Recognition to Adversarial Viewpoints

Recent studies have demonstrated that visual recognition models lack robustness to distribution shift. However, current work mainly considers model robustness to 2D image transformations, leaving viewpoint changes in the 3D world less…

Computer Vision and Pattern Recognition · Computer Science 2022-10-11 Yinpeng Dong , Shouwei Ruan , Hang Su , Caixin Kang , Xingxing Wei , Jun Zhu

Active Visuo-Tactile Interactive Robotic Perception for Accurate Object Pose Estimation in Dense Clutter

This work presents a novel active visuo-tactile based framework for robotic systems to accurately estimate pose of objects in dense cluttered environments. The scene representation is derived using a novel declutter graph (DG) which…

Robotics · Computer Science 2022-02-14 Prajval Kumar Murali , Anirvan Dutta , Michael Gentner , Etienne Burdet , Ravinder Dahiya , Mohsen Kaboli

Automatic Robot Path Planning for Visual Inspection from Object Shape

Visual inspection is a crucial yet time-consuming task across various industries. Numerous established methods employ machine learning in inspection tasks, necessitating specific training data that includes predefined inspection poses and…

Robotics · Computer Science 2023-12-06 O. Tasneem , R. Pieters

ViewNeRF: Unsupervised Viewpoint Estimation Using Category-Level Neural Radiance Fields

We introduce ViewNeRF, a Neural Radiance Field-based viewpoint estimation method that learns to predict category-level viewpoints directly from images during training. While NeRF is usually trained with ground-truth camera poses, multiple…

Computer Vision and Pattern Recognition · Computer Science 2022-12-02 Octave Mariotti , Oisin Mac Aodha , Hakan Bilen

AVR: Active Vision-Driven Precise Robot Manipulation with Viewpoint and Focal Length Optimization

Robotic manipulation in complex scenes demands precise perception of task-relevant details, yet fixed or suboptimal viewpoints often impair fine-grained perception and induce occlusions, constraining imitation-learned policies. We present…

Robotics · Computer Science 2025-09-29 Yushan Liu , Shilong Mu , Xintao Chao , Zizhen Li , Yao Mu , Tianxing Chen , Shoujie Li , Chuqiao Lyu , Xiao-Ping Zhang , Wenbo Ding

Multi-View Adaptive Fusion Network for 3D Object Detection

3D object detection based on LiDAR-camera fusion is becoming an emerging research theme for autonomous driving. However, it has been surprisingly difficult to effectively fuse both modalities without information loss and interference. To…

Computer Vision and Pattern Recognition · Computer Science 2020-12-09 Guojun Wang , Bin Tian , Yachen Zhang , Long Chen , Dongpu Cao , Jian Wu

CVFNet: Real-time 3D Object Detection by Learning Cross View Features

In recent years 3D object detection from LiDAR point clouds has made great progress thanks to the development of deep learning technologies. Although voxel or point based methods are popular in 3D object detection, they usually involve…

Computer Vision and Pattern Recognition · Computer Science 2022-07-18 Jiaqi Gu , Zhiyu Xiang , Pan Zhao , Tingming Bai , Lingxuan Wang , Xijun Zhao , Zhiyuan Zhang

Leveraging Spatial Attention and Edge Context for Optimized Feature Selection in Visual Localization

Visual localization determines an agent's precise position and orientation within an environment using visual data. It has become a critical task in the field of robotics, particularly in applications such as autonomous navigation. This is…

Computer Vision and Pattern Recognition · Computer Science 2025-02-25 Nanda Febri Istighfarin , HyungGi Jo