English
Related papers

Related papers: Activity Driven Weakly Supervised Object Detection

200 papers

We introduce the task of weakly supervised learning for detecting human and object interactions in videos. Our task poses unique challenges as a system does not know what types of human-object interactions are present in a video or the…

Computer Vision and Pattern Recognition · Computer Science 2021-10-08 Shuang Li , Yilun Du , Antonio Torralba , Josef Sivic , Bryan Russell

It is laborious to manually label point cloud data for training high-quality 3D object detectors. This work proposes a weakly supervised approach for 3D object detection, only requiring a small set of weakly annotated scenes, associated…

Computer Vision and Pattern Recognition · Computer Science 2020-07-24 Qinghao Meng , Wenguan Wang , Tianfei Zhou , Jianbing Shen , Luc Van Gool , Dengxin Dai

This paper presents a detection-aware pre-training (DAP) approach, which leverages only weakly-labeled classification-style datasets (e.g., ImageNet) for pre-training, but is specifically tailored to benefit object detection tasks. In…

Computer Vision and Pattern Recognition · Computer Science 2021-04-01 Yuanyi Zhong , Jianfeng Wang , Lijuan Wang , Jian Peng , Yu-Xiong Wang , Lei Zhang

We present a novel approach to weakly supervised object detection. Instead of annotated images, our method only requires two short videos to learn to detect a new object: 1) a video of a moving object and 2) one or more "negative" videos of…

Computer Vision and Pattern Recognition · Computer Science 2019-10-01 Rico Jonschkowski , Austin Stone

Weakly-supervised object detection attempts to limit the amount of supervision by dispensing the need for bounding boxes, but still assumes image-level labels on the entire training set. In this work, we study the problem of training an…

Computer Vision and Pattern Recognition · Computer Science 2021-07-22 Zhaohui Yang , Miaojing Shi , Chao Xu , Vittorio Ferrari , Yannis Avrithis

Object localization is an important computer vision problem with a variety of applications. The lack of large scale object-level annotations and the relative abundance of image-level labels makes a compelling case for weak supervision in…

Computer Vision and Pattern Recognition · Computer Science 2016-03-30 Archith J. Bency , Heesung Kwon , Hyungtae Lee , S. Karthikeyan , B. S. Manjunath

We study weakly-supervised video object grounding: given a video segment and a corresponding descriptive sentence, the goal is to localize objects that are mentioned from the sentence in the video. During training, no object bounding boxes…

Computer Vision and Pattern Recognition · Computer Science 2018-07-23 Luowei Zhou , Nathan Louis , Jason J. Corso

A critical object detection task is finetuning an existing model to detect novel objects, but the standard workflow requires bounding box annotations which are time-consuming and expensive to collect. Weakly supervised object detection…

Computer Vision and Pattern Recognition · Computer Science 2023-05-29 Tyler LaBonte , Yale Song , Xin Wang , Vibhav Vineet , Neel Joshi

Weakly supervised object detection aims at learning precise object detectors, given image category labels. In recent prevailing works, this problem is generally formulated as a multiple instance learning module guided by an image…

Computer Vision and Pattern Recognition · Computer Science 2019-04-02 Xiaoyan Li , Meina Kan , Shiguang Shan , Xilin Chen

Weakly supervised object detection (WSOD) aims to classify and locate objects with only image-level supervision. Many WSOD approaches adopt multiple instance learning as the initial model, which is prone to converge to the most…

Computer Vision and Pattern Recognition · Computer Science 2020-11-23 Wenlong Gao , Ying Chen , Yong Peng

Weakly supervised temporal action detection is a Herculean task in understanding untrimmed videos, since no supervisory signal except the video-level category label is available on training data. Under the supervision of category labels,…

Computer Vision and Pattern Recognition · Computer Science 2018-07-19 Jia-Xing Zhong , Nannan Li , Weijie Kong , Tao Zhang , Thomas H. Li , Ge Li

Temporal Activity Detection aims to predict activity classes per frame, in contrast to video-level predictions in Activity Classification (i.e., Activity Recognition). Due to the expensive frame-level annotations required for detection, the…

Computer Vision and Pattern Recognition · Computer Science 2023-02-07 Kumara Kahatapitiya , Zhou Ren , Haoxiang Li , Zhenyu Wu , Michael S. Ryoo , Gang Hua

Significant performance improvement has been achieved for fully-supervised video salient object detection with the pixel-wise labeled training datasets, which are time-consuming and expensive to obtain. To relieve the burden of data…

Computer Vision and Pattern Recognition · Computer Science 2021-04-07 Wangbo Zhao , Jing Zhang , Long Li , Nick Barnes , Nian Liu , Junwei Han

We present a semi-supervised approach that localizes multiple unknown object instances in long videos. We start with a handful of labeled boxes and iteratively learn and label hundreds of thousands of object instances. We propose criteria…

Computer Vision and Pattern Recognition · Computer Science 2015-05-22 Ishan Misra , Abhinav Shrivastava , Martial Hebert

In recent years, the performance of object detection has advanced significantly with the evolving deep convolutional neural networks. However, the state-of-the-art object detection methods still rely on accurate bounding box annotations…

Computer Vision and Pattern Recognition · Computer Science 2017-07-31 Qingyi Tao , Hao Yang , Jianfei Cai

Labeling data is often expensive and time-consuming, especially for tasks such as object detection and instance segmentation, which require dense labeling of the image. While few-shot object detection is about training a model on novel…

Computer Vision and Pattern Recognition · Computer Science 2022-08-24 Gabriel Huang , Issam Laradji , David Vazquez , Simon Lacoste-Julien , Pau Rodriguez

Weakly-supervised action localization aims to recognize and localize action instancese in untrimmed videos with only video-level labels. Most existing models rely on multiple instance learning(MIL), where the predictions of unlabeled…

Computer Vision and Pattern Recognition · Computer Science 2023-09-27 Guiqin Wang , Peng Zhao , Cong Zhao , Shusen Yang , Jie Cheng , Luziwei Leng , Jianxing Liao , Qinghai Guo

Understanding human behavior is an important problem in the pursuit of visual intelligence. A challenge in this endeavor is the extensive and costly effort required to accurately label action segments. To address this issue, we consider…

Computer Vision and Pattern Recognition · Computer Science 2025-04-09 Seth Z. Zhao , Reza Ghoddoosian , Isht Dwivedi , Nakul Agarwal , Behzad Dariush

Object detection has achieved promising success, but requires large-scale fully-annotated data, which is time-consuming and labor-extensive. Therefore, we consider object detection with mixed supervision, which learns novel object…

Computer Vision and Pattern Recognition · Computer Science 2021-10-28 Yan Liu , Zhijie Zhang , Li Niu , Junjie Chen , Liqing Zhang

When humans describe images they tend to use combinations of nouns and adjectives, corresponding to objects and their associated attributes respectively. To generate such a description automatically, one needs to model objects, attributes…

Computer Vision and Pattern Recognition · Computer Science 2015-04-02 Zhiyuan Shi , Yongxin Yang , Timothy M. Hospedales , Tao Xiang
‹ Prev 1 2 3 10 Next ›