Related papers: WebVision Database: Visual Learning and Understand…

WebVision Challenge: Visual Learning and Understanding With Web Data

We present the 2017 WebVision Challenge, a public image recognition challenge designed for deep learning based on web images without instance-level human annotation. Following the spirit of previous vision challenges, such as ILSVRC,…

Computer Vision and Pattern Recognition · Computer Science 2017-05-17 Wen Li , Limin Wang , Wei Li , Eirikur Agustsson , Jesse Berent , Abhinav Gupta , Rahul Sukthankar , Luc Van Gool

On Deep Representation Learning from Noisy Web Images

The keep-growing content of Web images may be the next important data source to scale up deep neural networks, which recently obtained a great success in the ImageNet classification challenge and related tasks. This prospect, however, has…

Computer Vision and Pattern Recognition · Computer Science 2016-07-19 Phong D. Vo , Alexandru Ginsca , Hervé Le Borgne , Adrian Popescu

WebUI: A Dataset for Enhancing Visual UI Understanding with Web Semantics

Modeling user interfaces (UIs) from visual information allows systems to make inferences about the functionality and semantics needed to support use cases in accessibility, app automation, and testing. Current datasets for training machine…

Human-Computer Interaction · Computer Science 2023-02-01 Jason Wu , Siyan Wang , Siman Shen , Yi-Hao Peng , Jeffrey Nichols , Jeffrey P. Bigham

Learning from Large-scale Noisy Web Data with Ubiquitous Reweighting for Image Classification

Many advances of deep learning techniques originate from the efforts of addressing the image classification task on large-scale datasets. However, the construction of such clean datasets is costly and time-consuming since the Internet is…

Computer Vision and Pattern Recognition · Computer Science 2019-04-04 Jia Li , Yafei Song , Jianfeng Zhu , Lele Cheng , Ying Su , Lin Ye , Pengcheng Yuan , Shumin Han

Extracting Visual Knowledge from the Internet: Making Sense of Image Data

Recent successes in visual recognition can be primarily attributed to feature representation, learning algorithms, and the ever-increasing size of labeled training data. Extensive research has been devoted to the first two, but much less…

Computer Vision and Pattern Recognition · Computer Science 2019-06-10 Yazhou Yao , Jian Zhang , Xiansheng Hua , Fumin Shen , Zhenmin Tang

Learning without Prejudice: Avoiding Bias in Webly-Supervised Action Recognition

Webly-supervised learning has recently emerged as an alternative paradigm to traditional supervised learning based on large-scale datasets with manual annotations. The key idea is that models such as CNNs can be learned from the noisy…

Computer Vision and Pattern Recognition · Computer Science 2017-09-08 Christian Rupprecht , Ansh Kapil , Nan Liu , Lamberto Ballan , Federico Tombari

Learning Deep Visual Object Models From Noisy Web Data: How to Make it Work

Deep networks thrive when trained on large scale data collections. This has given ImageNet a central role in the development of deep architectures for visual object classification. However, ImageNet was created during a specific period in…

Computer Vision and Pattern Recognition · Computer Science 2018-05-23 Nizar Massouh , Francesca Babiloni , Tatiana Tommasi , Jay Young , Nick Hawes , Barbara Caputo

CurriculumNet: Weakly Supervised Learning from Large-Scale Web Images

We present a simple yet efficient approach capable of training deep neural networks on large-scale weakly-supervised web images, which are crawled raw from the Internet by using text queries, without any human annotation. We develop a…

Computer Vision and Pattern Recognition · Computer Science 2018-10-19 Sheng Guo , Weilin Huang , Haozhi Zhang , Chenfan Zhuang , Dengke Dong , Matthew R. Scott , Dinglong Huang

VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search

Vision-Language Models have made significant progress on many perception-focused tasks. However, their progress on reasoning-focused tasks remains limited due to the lack of high-quality and diverse training data. In this work, we aim to…

Computer Vision and Pattern Recognition · Computer Science 2025-03-18 Yiming Jia , Jiachen Li , Xiang Yue , Bo Li , Ping Nie , Kai Zou , Wenhu Chen

Data-driven Meta-set Based Fine-Grained Visual Classification

Constructing fine-grained image datasets typically requires domain-specific expert knowledge, which is not always available for crowd-sourcing platform annotators. Accordingly, learning directly from web images becomes an alternative method…

Computer Vision and Pattern Recognition · Computer Science 2020-08-07 Chuanyi Zhang , Yazhou Yao , Xiangbo Shu , Zechao Li , Zhenmin Tang , Qi Wu

Learning to Learn from Noisy Web Videos

Understanding the simultaneously very diverse and intricately fine-grained set of possible human actions is a critical open problem in computer vision. Manually labeling training videos is feasible for some action classes but doesn't scale…

Computer Vision and Pattern Recognition · Computer Science 2017-06-12 Serena Yeung , Vignesh Ramanathan , Olga Russakovsky , Liyue Shen , Greg Mori , Li Fei-Fei

NoisyActions2M: A Multimedia Dataset for Video Understanding from Noisy Labels

Deep learning has shown remarkable progress in a wide range of problems. However, efficient training of such models requires large-scale datasets, and getting annotations for such datasets can be challenging and costly. In this work, we…

Multimedia · Computer Science 2021-10-14 Mohit Sharma , Raj Patra , Harshal Desai , Shruti Vyas , Yogesh Rawat , Rajiv Ratn Shah

Training Convolutional Networks with Web Images

In this thesis we investigate the effect of using web images to build a large scale database to be used along a deep learning method for a classification task. We replicate the ImageNet large scale database (ILSVRC-2012) from images…

Computer Vision and Pattern Recognition · Computer Science 2018-05-23 Nizar Massouh

Webly Supervised Learning of Convolutional Networks

We present an approach to utilize large amounts of web data for learning CNNs. Specifically inspired by curriculum learning, we present a two-step approach for CNN training. First, we use easy images to train an initial visual…

Computer Vision and Pattern Recognition · Computer Science 2015-10-09 Xinlei Chen , Abhinav Gupta

Image to Video Domain Adaptation Using Web Supervision

Training deep neural networks typically requires large amounts of labeled data which may be scarce or expensive to obtain for a particular target domain. As an alternative, we can leverage webly-supervised data (i.e. results from a public…

Computer Vision and Pattern Recognition · Computer Science 2019-08-06 Andrew Kae , Yale Song

Harvesting Visual Objects from Internet Images via Deep Learning Based Objectness Assessment

The collection of internet images has been growing in an astonishing speed. It is undoubted that these images contain rich visual information that can be useful in many applications, such as visual media creation and data-driven image…

Computer Vision and Pattern Recognition · Computer Science 2019-04-02 Kan Wu , Guanbin Li , Haofeng Li , Jianjun Zhang , Yizhou Yu

Learning from Noisy Web Data with Category-level Supervision

As tons of photos are being uploaded to public websites (e.g., Flickr, Bing, and Google) every day, learning from web data has become an increasingly popular research direction because of freely available web resources, which is also…

Computer Vision and Pattern Recognition · Computer Science 2018-05-25 Li Niu , Qingtao Tang , Ashok Veeraraghavan , Ashu Sabharwal

Learning Visual N-Grams from Web Data

Real-world image recognition systems need to recognize tens of thousands of classes that constitute a plethora of visual concepts. The traditional approach of annotating thousands of images per class for training is infeasible in such a…

Computer Vision and Pattern Recognition · Computer Science 2017-08-08 Ang Li , Allan Jabri , Armand Joulin , Laurens van der Maaten

Learning Visual Features from Large Weakly Supervised Data

Convolutional networks trained on large supervised dataset produce visual features which form the basis for the state-of-the-art in many computer-vision problems. Further improvements of these visual features will likely require even larger…

Computer Vision and Pattern Recognition · Computer Science 2015-11-10 Armand Joulin , Laurens van der Maaten , Allan Jabri , Nicolas Vasilache

Attend in groups: a weakly-supervised deep learning framework for learning from web data

Large-scale datasets have driven the rapid development of deep neural networks for visual recognition. However, annotating a massive dataset is expensive and time-consuming. Web images and their labels are, in comparison, much easier to…

Computer Vision and Pattern Recognition · Computer Science 2016-12-01 Bohan Zhuang , Lingqiao Liu , Yao Li , Chunhua Shen , Ian Reid