We introduce the matrix-based Renyi's α-order entropy functional to parameterize Tishby et al. information bottleneck (IB) principle with a neural network. We term our methodology Deep Deterministic Information Bottleneck (DIB), as it avoids variational inference and distribution assumption. We show that deep neural networks trained with DIB outperform the variational objective counterpart and those that are trained with other forms of regularization, in terms of generalization performance and robustness to adversarial attack.Code available at https://github.com/yuxi120407/DIB
@article{arxiv.2102.00533,
title = {Deep Deterministic Information Bottleneck with Matrix-based Entropy Functional},
author = {Xi Yu and Shujian Yu and Jose C. Principe},
journal= {arXiv preprint arXiv:2102.00533},
year = {2021}
}
Comments
Accepted at ICASSP-21. Code available at https://github.com/yuxi120407/DIB. Extended version of the suppelementary material in "Measuring the Dependence with Matrix-based Entropy Functional", AAAI-21, arXiv:2101.10160