English

Structured IB: Improving Information Bottleneck with Structured Feature Learning

Information Theory 2025-04-18 v2 Machine Learning math.IT

Abstract

The Information Bottleneck (IB) principle has emerged as a promising approach for enhancing the generalization, robustness, and interpretability of deep neural networks, demonstrating efficacy across image segmentation, document clustering, and semantic communication. Among IB implementations, the IB Lagrangian method, employing Lagrangian multipliers, is widely adopted. While numerous methods for the optimizations of IB Lagrangian based on variational bounds and neural estimators are feasible, their performance is highly dependent on the quality of their design, which is inherently prone to errors. To address this limitation, we introduce Structured IB, a framework for investigating potential structured features. By incorporating auxiliary encoders to extract missing informative features, we generate more informative representations. Our experiments demonstrate superior prediction accuracy and task-relevant information preservation compared to the original IB Lagrangian method, even with reduced network size.

Keywords

Cite

@article{arxiv.2412.08222,
  title  = {Structured IB: Improving Information Bottleneck with Structured Feature Learning},
  author = {Hanzhe Yang and Youlong Wu and Dingzhu Wen and Yong Zhou and Yuanming Shi},
  journal= {arXiv preprint arXiv:2412.08222},
  year   = {2025}
}