English

Explainable AI needs formalization

Machine Learning 2026-04-09 v6 Artificial Intelligence Machine Learning

Abstract

The field of "explainable artificial intelligence" (XAI) seemingly addresses the desire that decisions of machine learning systems should be human-understandable. However, in its current state, XAI itself needs scrutiny. Popular methods cannot reliably answer relevant questions about ML models, their training data, or test inputs, because they systematically attribute importance to input features that are independent of the prediction target. This limits the utility of XAI for diagnosing and correcting data and models, for scientific discovery, and for identifying intervention targets. The fundamental reason for this is that current XAI methods do not address well-defined problems and are not evaluated against targeted criteria of explanation correctness. Researchers should formally define the problems they intend to solve and design methods accordingly. This will lead to diverse use-case-dependent notions of explanation correctness and objective metrics of explanation performance that can be used to validate XAI algorithms.

Keywords

Cite

@article{arxiv.2409.14590,
  title  = {Explainable AI needs formalization},
  author = {Stefan Haufe and Rick Wilming and Benedict Clark and Rustam Zhumagambetov and Ahcène Boubekki and Jörg Martin and Danny Panknin},
  journal= {arXiv preprint arXiv:2409.14590},
  year   = {2026}
}