English

Expression Syntax Information Bottleneck for Math Word Problems

Computation and Language 2026-01-12 v2

Abstract

Math Word Problems (MWP) aims to automatically solve mathematical questions given in texts. Previous studies tend to design complex models to capture additional information in the original text so as to enable the model to gain more comprehensive features. In this paper, we turn our attention in the opposite direction, and work on how to discard redundant features containing spurious correlations for MWP. To this end, we design an Expression Syntax Information Bottleneck method for MWP (called ESIB) based on variational information bottleneck, which extracts essential features of expression syntax tree while filtering latent-specific redundancy containing syntax-irrelevant features. The key idea of ESIB is to encourage multiple models to predict the same expression syntax tree for different problem representations of the same problem by mutual learning so as to capture consistent information of expression syntax tree and discard latent-specific redundancy. To improve the generalization ability of the model and generate more diverse expressions, we design a self-distillation loss to encourage the model to rely more on the expression syntax information in the latent space. Experimental results on two large-scale benchmarks show that our model not only achieves state-of-the-art results but also generates more diverse solutions. The code is available in https://github.com/menik1126/math_ESIB.

Keywords

Cite

@article{arxiv.2310.15664,
  title  = {Expression Syntax Information Bottleneck for Math Word Problems},
  author = {Jing Xiong and Chengming Li and Min Yang and Xiping Hu and Bin Hu},
  journal= {arXiv preprint arXiv:2310.15664},
  year   = {2026}
}

Comments

This paper has been accepted by SIGIR 2022. The code can be found at https://github.com/menik1126/math_ESIB

R2 v1 2026-06-28T13:00:00.815Z