Related papers: Broadcasting Convolutional Network for Visual Rela…

An attention-driven hierarchical multi-scale representation for visual recognition

Convolutional Neural Networks (CNNs) have revolutionized the understanding of visual content. This is mainly due to their ability to break down an image into smaller pieces, extract multi-scale localized features and compose them to…

Computer Vision and Pattern Recognition · Computer Science 2021-10-26 Zachary Wharton , Ardhendu Behera , Asish Bera

Learning Multi-Domain Convolutional Neural Networks for Visual Tracking

We propose a novel visual tracking algorithm based on the representations from a discriminatively trained Convolutional Neural Network (CNN). Our algorithm pretrains a CNN using a large set of videos with tracking ground-truths to obtain a…

Computer Vision and Pattern Recognition · Computer Science 2016-01-07 Hyeonseob Nam , Bohyung Han

Striving for Simplicity: The All Convolutional Net

Most modern convolutional neural networks (CNNs) used for object recognition are built using the same principles: Alternating convolution and max-pooling layers followed by a small number of fully connected layers. We re-evaluate the state…

Machine Learning · Computer Science 2015-04-14 Jost Tobias Springenberg , Alexey Dosovitskiy , Thomas Brox , Martin Riedmiller

Big-Little Net: An Efficient Multi-Scale Feature Representation for Visual and Speech Recognition

In this paper, we propose a novel Convolutional Neural Network (CNN) architecture for learning multi-scale feature representations with good tradeoffs between speed and accuracy. This is achieved by using a multi-branch network, which has…

Computer Vision and Pattern Recognition · Computer Science 2019-08-01 Chun-Fu Chen , Quanfu Fan , Neil Mallinar , Tom Sercu , Rogerio Feris

A novel feature-scrambling approach reveals the capacity of convolutional neural networks to learn spatial relations

Convolutional neural networks (CNNs) are one of the most successful computer vision systems to solve object recognition. Furthermore, CNNs have major applications in understanding the nature of visual representations in the human brain. Yet…

Computer Vision and Pattern Recognition · Computer Science 2022-12-13 Amr Farahat , Felix Effenberger , Martin Vinck

Discovering objects and their relations from entangled scene representations

Our world can be succinctly and compactly described as structured scenes of objects and relations. A typical room, for example, contains salient objects such as tables, chairs and books, and these objects typically relate to each other by…

Machine Learning · Computer Science 2017-02-17 David Raposo , Adam Santoro , David Barrett , Razvan Pascanu , Timothy Lillicrap , Peter Battaglia

CoupleNet: Coupling Global Structure with Local Parts for Object Detection

The region-based Convolutional Neural Network (CNN) detectors such as Faster R-CNN or R-FCN have already shown promising results for object detection by combining the region proposal subnetwork and the classification subnetwork together.…

Computer Vision and Pattern Recognition · Computer Science 2017-08-10 Yousong Zhu , Chaoyang Zhao , Jinqiao Wang , Xu Zhao , Yi Wu , Hanqing Lu

A simple neural network module for relational reasoning

Relational reasoning is a central component of generally intelligent behavior, but has proven difficult for neural networks to learn. In this paper we describe how to use Relation Networks (RNs) as a simple plug-and-play module to solve…

Computation and Language · Computer Science 2017-06-06 Adam Santoro , David Raposo , David G. T. Barrett , Mateusz Malinowski , Razvan Pascanu , Peter Battaglia , Timothy Lillicrap

Exploit Bounding Box Annotations for Multi-label Object Recognition

Convolutional neural networks (CNNs) have shown great performance as general feature representations for object recognition applications. However, for multi-label images that contain multiple objects from different categories, scales and…

Computer Vision and Pattern Recognition · Computer Science 2016-06-06 Hao Yang , Joey Tianyi Zhou , Yu Zhang , Bin-Bin Gao , Jianxin Wu , Jianfei Cai

BCNN: Binary Complex Neural Network

Binarized neural networks, or BNNs, show great promise in edge-side applications with resource limited hardware, but raise the concerns of reduced accuracy. Motivated by the complex neural networks, in this paper we introduce complex…

Neural and Evolutionary Computing · Computer Science 2021-04-21 Yanfei Li , Tong Geng , Ang Li , Huimin Yu

Memory-Based Neighbourhood Embedding for Visual Recognition

Learning discriminative image feature embeddings is of great importance to visual recognition. To achieve better feature embeddings, most current methods focus on designing different network structures or loss functions, and the estimated…

Computer Vision and Pattern Recognition · Computer Science 2019-08-15 Suichan Li , Dapeng Chen , Bin Liu , Nenghai Yu , Rui Zhao

On the Exploration of Convolutional Fusion Networks for Visual Recognition

Despite recent advances in multi-scale deep representations, their limitations are attributed to expensive parameters and weak fusion modules. Hence, we propose an efficient approach to fuse multi-scale deep representations, called…

Computer Vision and Pattern Recognition · Computer Science 2016-11-18 Yu Liu , Yanming Guo , Michael S. Lew

SCNN: A General Distribution based Statistical Convolutional Neural Network with Application to Video Object Detection

Various convolutional neural networks (CNNs) were developed recently that achieved accuracy comparable with that of human beings in computer vision tasks such as image recognition, object detection and tracking, etc. Most of these networks,…

Computer Vision and Pattern Recognition · Computer Science 2019-03-20 Tianchen Wang , Jinjun Xiong , Xiaowei Xu , Yiyu Shi

B-CNN: Branch Convolutional Neural Network for Hierarchical Classification

Convolutional Neural Network (CNN) image classifiers are traditionally designed to have sequential convolutional layers with a single output layer. This is based on the assumption that all target classes should be treated equally and…

Computer Vision and Pattern Recognition · Computer Science 2017-10-06 Xinqi Zhu , Michael Bain

MDCN: Multi-Scale, Deep Inception Convolutional Neural Networks for Efficient Object Detection

Object detection in challenging situations such as scale variation, occlusion, and truncation depends not only on feature details but also on contextual information. Most previous networks emphasize too much on detailed feature extraction…

Computer Vision and Pattern Recognition · Computer Science 2018-09-07 Wenchi Ma , Yuanwei Wu , Zongbo Wang , Guanghui Wang

SANet: Structure-Aware Network for Visual Tracking

Convolutional neural network (CNN) has drawn increasing interest in visual tracking owing to its powerfulness in feature extraction. Most existing CNN-based trackers treat tracking as a classification problem. However, these trackers are…

Computer Vision and Pattern Recognition · Computer Science 2017-05-02 Heng Fan , Haibin Ling

Omnidirectional CNN for Visual Place Recognition and Navigation

$ $Visual place recognition is challenging, especially when only a few place exemplars are given. To mitigate the challenge, we consider place recognition method using omnidirectional cameras and propose a novel Omnidirectional…

Computer Vision and Pattern Recognition · Computer Science 2018-03-13 Tsun-Hsuan Wang , Hung-Jui Huang , Juan-Ting Lin , Chan-Wei Hu , Kuo-Hao Zeng , Min Sun

GCCN: Global Context Convolutional Network

In this paper, we propose Global Context Convolutional Network (GCCN) for visual recognition. GCCN computes global features representing contextual information across image patches. These global contextual features are defined as local…

Computer Vision and Pattern Recognition · Computer Science 2021-10-25 Ali Hamdi , Flora Salim , Du Yong Kim

Multi-dimensional Graph Convolutional Networks

Convolutional neural networks (CNNs) leverage the great power in representation learning on regular grid data such as image and video. Recently, increasing attention has been paid on generalizing CNNs to graph or network data which is…

Social and Information Networks · Computer Science 2018-08-21 Yao Ma , Suhang Wang , Charu C. Aggarwal , Dawei Yin , Jiliang Tang

A Novel Multi-Attention Driven System For Multi-Label Remote Sensing Image Classification

This paper presents a novel multi-attention driven system that jointly exploits Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN) in the context of multi-label remote sensing (RS) image classification. The proposed…

Computer Vision and Pattern Recognition · Computer Science 2019-11-26 Gencer Sumbul , Begüm Demir