Related papers: A Dataset for Developing and Benchmarking Active V…

Recognizing Objects In-the-wild: Where Do We Stand?

The ability to recognize objects is an essential skill for a robotic system acting in human-populated environments. Despite decades of effort from the robotic and vision research communities, robots are still missing good visual perceptual…

Robotics · Computer Science 2018-05-23 Mohammad Reza Loghmani , Barbara Caputo , Markus Vincze

Active Object Localization with Deep Reinforcement Learning

We present an active detection model for localizing objects in scenes. The model is class-specific and allows an agent to focus attention on candidate regions for identifying the correct location of a target object. This agent learns to…

Computer Vision and Pattern Recognition · Computer Science 2015-11-20 Juan C. Caicedo , Svetlana Lazebnik

Real-World Image Datasets for Federated Learning

Federated learning is a new machine learning paradigm which allows data parties to build machine learning models collaboratively while keeping their data secure and private. While research efforts on federated learning have been growing…

Computer Vision and Pattern Recognition · Computer Science 2021-01-06 Jiahuan Luo , Xueyang Wu , Yun Luo , Anbu Huang , Yunfeng Huang , Yang Liu , Qiang Yang

ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes

A key requirement for leveraging supervised deep learning methods is the availability of large, labeled datasets. Unfortunately, in the context of RGB-D scene understanding, very little data is available -- current datasets cover a small…

Computer Vision and Pattern Recognition · Computer Science 2017-04-12 Angela Dai , Angel X. Chang , Manolis Savva , Maciej Halber , Thomas Funkhouser , Matthias Nießner

HabitatDyn Dataset: Dynamic Object Detection to Kinematics Estimation

The advancement of computer vision and machine learning has made datasets a crucial element for further research and applications. However, the creation and development of robots with advanced recognition capabilities are hindered by the…

Computer Vision and Pattern Recognition · Computer Science 2023-04-24 Zhengcheng Shen , Yi Gao , Linh Kästner , Jens Lambrecht

Multiview RGB-D Dataset for Object Instance Detection

This paper presents a new multi-view RGB-D dataset of nine kitchen scenes, each containing several objects in realistic cluttered environments including a subset of objects from the BigBird dataset. The viewpoints of the scenes are densely…

Computer Vision and Pattern Recognition · Computer Science 2016-09-27 Georgios Georgakis , Md Alimoor Reza , Arsalan Mousavian , Phi-Hung Le , Jana Kosecka

GraspNet: A Large-Scale Clustered and Densely Annotated Dataset for Object Grasping

Object grasping is critical for many applications, which is also a challenging computer vision problem. However, for the clustered scene, current researches suffer from the problems of insufficient training data and the lacking of…

Computer Vision and Pattern Recognition · Computer Science 2020-01-03 Hao-Shu Fang , Chenxi Wang , Minghao Gou , Cewu Lu

Dataset for Robust and Accurate Leading Vehicle Velocity Recognition

Recognition of the surrounding environment using a camera is an important technology in Advanced Driver-Assistance Systems and Autonomous Driving, and recognition technology is often solved by machine learning approaches such as deep…

Computer Vision and Pattern Recognition · Computer Science 2022-04-28 Genya Ogawa , Toru Saito , Noriyuki Aoi

360-Indoor: Towards Learning Real-World Objects in 360{\deg} Indoor Equirectangular Images

While there are several widely used object detection datasets, current computer vision algorithms are still limited in conventional images. Such images narrow our vision in a restricted region. On the other hand, 360{\deg} images provide a…

Computer Vision and Pattern Recognition · Computer Science 2019-10-07 Shih-Han Chou , Cheng Sun , Wen-Yen Chang , Wan-Ting Hsu , Min Sun , Jianlong Fu

Robot Active Neural Sensing and Planning in Unknown Cluttered Environments

Active sensing and planning in unknown, cluttered environments is an open challenge for robots intending to provide home service, search and rescue, narrow-passage inspection, and medical assistance. Although many active sensing methods…

Robotics · Computer Science 2022-08-25 Hanwen Ren , Ahmed H. Qureshi

OpenLORIS-Object: A Robotic Vision Dataset and Benchmark for Lifelong Deep Learning

The recent breakthroughs in computer vision have benefited from the availability of large representative datasets (e.g. ImageNet and COCO) for training. Yet, robotic vision poses unique challenges for applying visual algorithms developed…

Computer Vision and Pattern Recognition · Computer Science 2020-03-09 Qi She , Fan Feng , Xinyue Hao , Qihan Yang , Chuanlin Lan , Vincenzo Lomonaco , Xuesong Shi , Zhengwei Wang , Yao Guo , Yimin Zhang , Fei Qiao , Rosa H. M. Chan

Kinematically-Informed Interactive Perception: Robot-Generated 3D Models for Classification

To be useful in everyday environments, robots must be able to observe and learn about objects. Recent datasets enable progress for classifying data into known object categories; however, it is unclear how to collect reliable object data…

Robotics · Computer Science 2019-01-18 Abhishek Venkataraman , Brent Griffin , Jason J. Corso

A large scale multi-view RGBD visual affordance learning dataset

The physical and textural attributes of objects have been widely studied for recognition, detection and segmentation tasks in computer vision.~A number of datasets, such as large scale ImageNet, have been proposed for feature learning using…

Computer Vision and Pattern Recognition · Computer Science 2023-09-14 Zeyad Khalifa , Syed Afaq Ali Shah

RGBD Datasets: Past, Present and Future

Since the launch of the Microsoft Kinect, scores of RGBD datasets have been released. These have propelled advances in areas from reconstruction to gesture recognition. In this paper we explore the field, reviewing datasets across eight…

Computer Vision and Pattern Recognition · Computer Science 2016-04-14 Michael Firman

THUD++: Large-Scale Dynamic Indoor Scene Dataset and Benchmark for Mobile Robots

Most existing mobile robotic datasets primarily capture static scenes, limiting their utility for evaluating robotic performance in dynamic environments. To address this, we present a mobile robot oriented large-scale indoor dataset,…

Robotics · Computer Science 2024-12-12 Zeshun Li , Fuhao Li , Wanting Zhang , Zijie Zheng , Xueping Liu , Yongjin Liu , Long Zeng

Object Recognition Datasets and Challenges: A Review

Object recognition is among the fundamental tasks in the computer vision applications, paving the path for all other image understanding operations. In every stage of progress in object recognition research, efforts have been made to…

Computer Vision and Pattern Recognition · Computer Science 2025-07-31 Aria Salari , Abtin Djavadifar , Xiangrui Liu , Homayoun Najjaran

Mobile Robot Oriented Large-Scale Indoor Dataset for Dynamic Scene Understanding

Most existing robotic datasets capture static scene data and thus are limited in evaluating robots' dynamic performance. To address this, we present a mobile robot oriented large-scale indoor dataset, denoted as THUD (Tsinghua University…

Robotics · Computer Science 2024-07-02 Yifan Tang , Cong Tai , Fangxing Chen , Wanting Zhang , Tao Zhang , Xueping Liu , Yongjin Liu , Long Zeng

Real-Time Indoor Object Detection based on hybrid CNN-Transformer Approach

Real-time object detection in indoor settings is a challenging area of computer vision, faced with unique obstacles such as variable lighting and complex backgrounds. This field holds significant potential to revolutionize applications like…

Computer Vision and Pattern Recognition · Computer Science 2024-09-04 Salah Eddine Laidoudi , Madjid Maidi , Samir Otmane

Bamboo: Building Mega-Scale Vision Dataset Continually with Human-Machine Synergy

Large-scale datasets play a vital role in computer vision. But current datasets are annotated blindly without differentiation to samples, making the data collection inefficient and unscalable. The open question is how to build a mega-scale…

Computer Vision and Pattern Recognition · Computer Science 2022-08-26 Yuanhan Zhang , Qinghong Sun , Yichun Zhou , Zexin He , Zhenfei Yin , Kun Wang , Lu Sheng , Yu Qiao , Jing Shao , Ziwei Liu

NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis

Recent approaches in depth-based human activity analysis achieved outstanding performance and proved the effectiveness of 3D representation for classification of action classes. Currently available depth-based and RGB+D-based action…

Computer Vision and Pattern Recognition · Computer Science 2016-04-12 Amir Shahroudy , Jun Liu , Tian-Tsong Ng , Gang Wang