English
Related papers

Related papers: Refining activation downsampling with SoftPool

200 papers

Pooling layers are essential building blocks of convolutional neural networks (CNNs), to reduce computational overhead and increase the receptive fields of proceeding convolutional operations. Their goal is to produce downsampled volumes…

Computer Vision and Pattern Recognition · Computer Science 2022-12-05 Alexandros Stergiou , Ronald Poppe

Feature pooling layers (e.g., max pooling) in convolutional neural networks (CNNs) serve the dual purpose of providing increasingly abstract representations as well as yielding computational savings in subsequent convolutional layers. We…

Machine Learning · Computer Science 2016-11-17 Shuangfei Zhai , Hui Wu , Abhishek Kumar , Yu Cheng , Yongxi Lu , Zhongfei Zhang , Rogerio Feris

We reduce training time in convolutional networks (CNNs) with a method that, for some of the mini-batches: a) scales down the resolution of input images via downsampling, and b) reduces the forward pass operations via pooling on the…

Machine Learning · Computer Science 2019-10-16 Zissis Poulos , Ali Nouri , Andreas Moshovos

Pooling is a critical operation in convolutional neural networks for increasing receptive fields and improving robustness to input variations. Most existing pooling operations downsample the feature maps, which is a lossy process. Moreover,…

Computer Vision and Pattern Recognition · Computer Science 2021-04-05 Jiaojiao Zhao , Cees G. M. Snoek

Nowadays, Deep Neural Networks are among the main tools used in various sciences. Convolutional Neural Network is a special type of DNN consisting of several convolution layers, each followed by an activation function and a pooling layer.…

Computer Vision and Pattern Recognition · Computer Science 2020-09-17 Hossein Gholamalinezhad , Hossein Khosravi

Standard Convolutional Neural Networks (CNNs) designed for computer vision tasks tend to have large intermediate activation maps. These require large working memory and are thus unsuitable for deployment on resource-constrained devices…

Computer Vision and Pattern Recognition · Computer Science 2020-10-26 Oindrila Saha , Aditya Kusupati , Harsha Vardhan Simhadri , Manik Varma , Prateek Jain

Most convolutional neural networks use some method for gradually downscaling the size of the hidden layers. This is commonly referred to as pooling, and is applied to reduce the number of parameters, improve invariance to certain…

Computer Vision and Pattern Recognition · Computer Science 2018-04-13 Faraz Saeedan , Nicolas Weber , Michael Goesele , Stefan Roth

In most convolution neural networks (CNNs), downsampling hidden layers is adopted for increasing computation efficiency and the receptive field size. Such operation is commonly so-called pooling. Maximation and averaging over sliding…

Computer Vision and Pattern Recognition · Computer Science 2020-10-12 Hao Zhang , Jianwei Ma

Many convolutional neural networks (CNNs) rely on progressive downsampling of their feature maps to increase the network's receptive field and decrease computational cost. However, this comes at the price of losing granularity in the…

Computer Vision and Pattern Recognition · Computer Science 2023-05-17 Robin Hesse , Simone Schaub-Meyer , Stefan Roth

Pooling is essentially an operation from the field of Mathematical Morphology, with max pooling as a limited special case. The more general setting of MorphPooling greatly extends the tool set for building neural networks. In addition to…

Computer Vision and Pattern Recognition · Computer Science 2022-11-28 Rick Groenendijk , Leo Dorst , Theo Gevers

Convolutional Neural Networks (CNNs) are artificial learning systems typically based on two operations: convolution, which implements feature extraction through filtering, and pooling, which implements dimensionality reduction. The impact…

Machine Learning · Computer Science 2022-02-18 Dimitrios E. Diamantis , Dimitris K. Iakovidis

Deep convolutional neural networks (CNN) have shown their promise as a universal representation for recognition. However, global CNN activations lack geometric invariance, which limits their robustness for classification and matching of…

Computer Vision and Pattern Recognition · Computer Science 2014-09-10 Yunchao Gong , Liwei Wang , Ruiqi Guo , Svetlana Lazebnik

A number of recent studies have shown that a Deep Convolutional Neural Network (DCNN) pretrained on a large dataset can be adopted as a universal image description which leads to astounding performance in many visual classification tasks.…

Computer Vision and Pattern Recognition · Computer Science 2014-12-01 Lingqiao Liu , Chunhua Shen , Anton van den Hengel

In the framework of convolutional neural networks that lie at the heart of deep learning, downsampling is often performed with a max-pooling operation that only retains the element with maximum activation, while completely discarding the…

Computer Vision and Pattern Recognition · Computer Science 2018-04-17 Ashwani Kumar

In modern computer vision tasks, convolutional neural networks (CNNs) are indispensable for image classification tasks due to their efficiency and effectiveness. Part of their superiority compared to other architectures, comes from the fact…

Machine Learning · Computer Science 2019-06-11 Vighnesh Birodkar , Hossein Mobahi , Dilip Krishnan , Samy Bengio

In convolutional neural networks (CNNs), downsampling operations are crucial to model performance. Although traditional downsampling methods (such as maximum pooling and cross-row convolution) perform well in feature aggregation, receptive…

Computer Vision and Pattern Recognition · Computer Science 2025-07-22 Wenbo Yue , Chang Li , Guoping Xu

Recent studies have shown that a Deep Convolutional Neural Network (DCNN) pretrained on a large image dataset can be used as a universal image descriptor, and that doing so leads to impressive performance for a variety of image…

Computer Vision and Pattern Recognition · Computer Science 2016-12-23 Lingqiao Liu , Chunhua Shen , Anton van den Hengel

Despite the recent success in many applications, the high computational requirements of vision transformers limit their use in resource-constrained settings. While many existing methods improve the quadratic complexity of attention, in most…

Computer Vision and Pattern Recognition · Computer Science 2023-02-28 Dmitrii Marin , Jen-Hao Rick Chang , Anurag Ranjan , Anish Prabhu , Mohammad Rastegari , Oncel Tuzel

Pooling is a simple but essential layer in modern deep CNN architectures for feature aggregation and extraction. Typical CNN design focuses on the conv layers and activation functions, while leaving the pooling layers with fewer options. We…

Computer Vision and Pattern Recognition · Computer Science 2021-10-22 Bor-Shiun Wang , Jun-Wei Hsieh , Ming-Ching Chang , Ping-Yang Chen , Lipeng Ke , Siwei Lyu

Recently, very deep convolutional neural networks (CNNs) have shown outstanding performance in object recognition and have also been the first choice for dense classification problems such as semantic segmentation. However, repeated…

Computer Vision and Pattern Recognition · Computer Science 2016-11-28 Guosheng Lin , Anton Milan , Chunhua Shen , Ian Reid
‹ Prev 1 2 3 10 Next ›