English

Set Aggregation Network as a Trainable Pooling Layer

Machine Learning 2020-01-23 v3 Artificial Intelligence Computer Vision and Pattern Recognition Machine Learning

Abstract

Global pooling, such as max- or sum-pooling, is one of the key ingredients in deep neural networks used for processing images, texts, graphs and other types of structured data. Based on the recent DeepSets architecture proposed by Zaheer et al. (NIPS 2017), we introduce a Set Aggregation Network (SAN) as an alternative global pooling layer. In contrast to typical pooling operators, SAN allows to embed a given set of features to a vector representation of arbitrary size. We show that by adjusting the size of embedding, SAN is capable of preserving the whole information from the input. In experiments, we demonstrate that replacing global pooling layer by SAN leads to the improvement of classification accuracy. Moreover, it is less prone to overfitting and can be used as a regularizer.

Keywords

Cite

@article{arxiv.1810.01868,
  title  = {Set Aggregation Network as a Trainable Pooling Layer},
  author = {Łukasz Maziarka and Marek Śmieja and Aleksandra Nowak and Jacek Tabor and Łukasz Struski and Przemysław Spurek},
  journal= {arXiv preprint arXiv:1810.01868},
  year   = {2020}
}

Comments

ICONIP 2019