Related papers: A Multi-scale Multiple Instance Video Description …

Multiple Instance Learning Convolutional Neural Networks for Object Recognition

Convolutional Neural Networks (CNN) have demon- strated its successful applications in computer vision, speech recognition, and natural language processing. For object recog- nition, CNNs might be limited by its strict label requirement and…

Computer Vision and Pattern Recognition · Computer Science 2016-10-12 Miao Sun , Tony X. Han , Ming-Chang Liu , Ahmad Khodayari-Rostamabad

Efficient Large Scale Video Classification

Video classification has advanced tremendously over the recent years. A large part of the improvements in video classification had to do with the work done by the image classification community and the use of deep convolutional networks…

Computer Vision and Pattern Recognition · Computer Science 2015-05-26 Balakrishnan Varadarajan , George Toderici , Sudheendra Vijayanarasimhan , Apostol Natsev

Beyond Short Snippets: Deep Networks for Video Classification

Convolutional neural networks (CNNs) have been extensively applied for image recognition problems giving state-of-the-art results on recognition, detection, segmentation and retrieval. In this work we propose and evaluate several deep…

Computer Vision and Pattern Recognition · Computer Science 2015-04-14 Joe Yue-Hei Ng , Matthew Hausknecht , Sudheendra Vijayanarasimhan , Oriol Vinyals , Rajat Monga , George Toderici

Video Semantic Object Segmentation by Self-Adaptation of DCNN

This paper proposes a new framework for semantic segmentation of objects in videos. We address the label inconsistency problem of deep convolutional neural networks (DCNNs) by exploiting the fact that videos have multiple frames; in a few…

Computer Vision and Pattern Recognition · Computer Science 2017-11-23 Seong-Jin Park , Ki-Sang Hong

Multi-scale recognition with DAG-CNNs

We explore multi-scale convolutional neural nets (CNNs) for image classification. Contemporary approaches extract features from a single output layer. By extracting features from multiple layers, one can simultaneously reason about high,…

Computer Vision and Pattern Recognition · Computer Science 2015-05-21 Songfan Yang , Deva Ramanan

Big-Little Net: An Efficient Multi-Scale Feature Representation for Visual and Speech Recognition

In this paper, we propose a novel Convolutional Neural Network (CNN) architecture for learning multi-scale feature representations with good tradeoffs between speed and accuracy. This is achieved by using a multi-branch network, which has…

Computer Vision and Pattern Recognition · Computer Science 2019-08-01 Chun-Fu Chen , Quanfu Fan , Neil Mallinar , Tom Sercu , Rogerio Feris

CNN-RNN: A Unified Framework for Multi-label Image Classification

While deep convolutional neural networks (CNNs) have shown a great success in single-label image classification, it is important to note that real world images generally contain multiple labels, which could correspond to different objects,…

Computer Vision and Pattern Recognition · Computer Science 2016-04-18 Jiang Wang , Yi Yang , Junhua Mao , Zhiheng Huang , Chang Huang , Wei Xu

Visual Saliency Based on Multiscale Deep Features

Visual saliency is a fundamental problem in both cognitive and computational sciences, including computer vision. In this CVPR 2015 paper, we discover that a high-quality visual saliency model can be trained with multiscale features…

Computer Vision and Pattern Recognition · Computer Science 2015-04-13 Guanbin Li , Yizhou Yu

Micro-Net: A unified model for segmentation of various objects in microscopy images

Object segmentation and structure localization are important steps in automated image analysis pipelines for microscopy images. We present a convolution neural network (CNN) based deep learning architecture for segmentation of objects in…

Computer Vision and Pattern Recognition · Computer Science 2019-01-24 Shan E Ahmed Raza , Linda Cheung , Muhammad Shaban , Simon Graham , David Epstein , Stella Pelengaris , Michael Khan , Nasir M. Rajpoot

Hybrid CNN and Dictionary-Based Models for Scene Recognition and Domain Adaptation

Convolutional neural network (CNN) has achieved state-of-the-art performance in many different visual tasks. Learned from a large-scale training dataset, CNN features are much more discriminative and accurate than the hand-crafted features.…

Computer Vision and Pattern Recognition · Computer Science 2016-02-01 Guo-Sen Xie , Xu-Yao Zhang , Shuicheng Yan , Cheng-Lin Liu

Describing Videos by Exploiting Temporal Structure

Recent progress in using recurrent neural networks (RNNs) for image description has motivated the exploration of their application for video description. However, while images are static, working with videos requires modeling their dynamic…

Machine Learning · Statistics 2015-10-02 Li Yao , Atousa Torabi , Kyunghyun Cho , Nicolas Ballas , Christopher Pal , Hugo Larochelle , Aaron Courville

An attention-driven hierarchical multi-scale representation for visual recognition

Convolutional Neural Networks (CNNs) have revolutionized the understanding of visual content. This is mainly due to their ability to break down an image into smaller pieces, extract multi-scale localized features and compose them to…

Computer Vision and Pattern Recognition · Computer Science 2021-10-26 Zachary Wharton , Ardhendu Behera , Asish Bera

Multi-Instance Multi-Scale CNN for Medical Image Classification

Deep learning for medical image classification faces three major challenges: 1) the number of annotated medical images for training are usually small; 2) regions of interest (ROIs) are relatively small with unclear boundaries in the whole…

Computer Vision and Pattern Recognition · Computer Science 2019-10-23 Shaohua Li , Yong Liu , Xiuchao Sui , Cheng Chen , Gabriel Tjio , Daniel Shu Wei Ting , Rick Siow Mong Goh

Large-Scale 3D Scene Classification With Multi-View Volumetric CNN

We introduce a method to classify imagery using a convo- lutional neural network (CNN) on multi-view image pro- jections. The power of our method comes from using pro- jections of multiple images at multiple depth planes near the…

Computer Vision and Pattern Recognition · Computer Science 2017-12-27 Dror Aiger , Brett Allen , Aleksey Golovinskiy

Learning Enriched Features for Real Image Restoration and Enhancement

With the goal of recovering high-quality image content from its degraded version, image restoration enjoys numerous applications, such as in surveillance, computational photography, medical imaging, and remote sensing. Recently,…

Computer Vision and Pattern Recognition · Computer Science 2020-07-09 Syed Waqas Zamir , Aditya Arora , Salman Khan , Munawar Hayat , Fahad Shahbaz Khan , Ming-Hsuan Yang , Ling Shao

CMS-RCNN: Contextual Multi-Scale Region-based CNN for Unconstrained Face Detection

Robust face detection in the wild is one of the ultimate components to support various facial related problems, i.e. unconstrained face recognition, facial periocular recognition, facial landmarking and pose estimation, facial expression…

Computer Vision and Pattern Recognition · Computer Science 2016-06-20 Chenchen Zhu , Yutong Zheng , Khoa Luu , Marios Savvides

Multi-scale Unified Network for Image Classification

Convolutional Neural Networks (CNNs) have advanced significantly in visual representation learning and recognition. However, they face notable challenges in performance and computational efficiency when dealing with real-world, multi-scale…

Computer Vision and Pattern Recognition · Computer Science 2024-03-28 Wenzhuo Liu , Fei Zhu , Cheng-Lin Liu

Classifying and Segmenting Microscopy Images Using Convolutional Multiple Instance Learning

Convolutional neural networks (CNN) have achieved state of the art performance on both classification and segmentation tasks. Applying CNNs to microscopy images is challenging due to the lack of datasets labeled at the single cell level. We…

Computer Vision and Pattern Recognition · Computer Science 2017-05-25 Oren Z. Kraus , Lei Jimmy Ba , Brendan Frey

Multimodal Convolutional Neural Networks for Matching Image and Sentence

In this paper, we propose multimodal convolutional neural networks (m-CNNs) for matching image and sentence. Our m-CNN provides an end-to-end framework with convolutional architectures to exploit image representation, word composition, and…

Computer Vision and Pattern Recognition · Computer Science 2015-09-01 Lin Ma , Zhengdong Lu , Lifeng Shang , Hang Li

Knowledge Guided Disambiguation for Large-Scale Scene Classification with Multi-Resolution CNNs

Convolutional Neural Networks (CNNs) have made remarkable progress on scene recognition, partially due to these recent large-scale scene datasets, such as the Places and Places2. Scene categories are often defined by multi-level…

Computer Vision and Pattern Recognition · Computer Science 2017-04-26 Limin Wang , Sheng Guo , Weilin Huang , Yuanjun Xiong , Yu Qiao