English

Kernel Self-Attention in Deep Multiple Instance Learning

Machine Learning 2021-03-08 v2 Computer Vision and Pattern Recognition Machine Learning

Abstract

Not all supervised learning problems are described by a pair of a fixed-size input tensor and a label. In some cases, especially in medical image analysis, a label corresponds to a bag of instances (e.g. image patches), and to classify such bag, aggregation of information from all of the instances is needed. There have been several attempts to create a model working with a bag of instances, however, they are assuming that there are no dependencies within the bag and the label is connected to at least one instance. In this work, we introduce Self-Attention Attention-based MIL Pooling (SA-AbMILP) aggregation operation to account for the dependencies between instances. We conduct several experiments on MNIST, histological, microbiological, and retinal databases to show that SA-AbMILP performs better than other models. Additionally, we investigate kernel variations of Self-Attention and their influence on the results.

Cite

@article{arxiv.2005.12991,
  title  = {Kernel Self-Attention in Deep Multiple Instance Learning},
  author = {Dawid Rymarczyk and Adriana Borowa and Jacek Tabor and Bartosz Zieliński},
  journal= {arXiv preprint arXiv:2005.12991},
  year   = {2021}
}

Comments

https://openaccess.thecvf.com/content/WACV2021/papers/Rymarczyk_Kernel_Self-Attention_for_Weakly-Supervised_Image_Classification_Using_Deep_Multiple_Instance_WACV_2021_paper.pdf

R2 v1 2026-06-23T15:50:02.169Z