Related papers: Computable functors and effective interpretability

On functors enumerating structures

We study a new notion of reduction between structures called enumerable functors related to the recently investigated notion of computable functors. Our main result shows that enumerable functors and effective interpretability with the…

Logic · Mathematics 2017-08-11 Dino Rossegger

Positive enumerable functors

We study reductions well suited to compare structures and classes of structures with respect to properties based on enumeration reducibility. We introduce the notion of a positive enumerable functor and study the relationship with…

Logic · Mathematics 2021-02-10 Barbara Csima , Dino Rossegger , Zhi Ying "Daniel" Yu

Assessing the Local Interpretability of Machine Learning Models

The increasing adoption of machine learning tools has led to calls for accountability via model interpretability. But what does it mean for a machine learning model to be interpretable by humans, and how can this be assessed? We focus on…

Machine Learning · Computer Science 2019-08-06 Dylan Slack , Sorelle A. Friedler , Carlos Scheidegger , Chitradeep Dutta Roy

On quantitative aspects of model interpretability

Despite the growing body of work in interpretable machine learning, it remains unclear how to evaluate different explainability methods without resorting to qualitative assessment and user-studies. While interpretability is an inherently…

Machine Learning · Computer Science 2020-07-16 An-phi Nguyen , María Rodríguez Martínez

Generalized Effective Reducibility

We introduce two notions of effective reducibility for set-theoretical statements, based on computability with Ordinal Turing Machines (OTMs), one of which resembles Turing reducibility while the other is modelled after Weihrauch…

Logic · Mathematics 2026-05-19 Merlin Carl

The Definitions of Interpretability and Learning of Interpretable Models

As machine learning algorithms getting adopted in an ever-increasing number of applications, interpretation has emerged as a crucial desideratum. In this paper, we propose a mathematical definition for the human-interpretable model. In…

Machine Learning · Computer Science 2021-06-01 Weishen Pan , Changshui Zhang

An effective version of definability in metric model theory

In this paper, a computably definable predicate is defined and characterized. Then, it is proved that every separable infinite-dimensional Hilbert structure in an effectively presented language is computable. Moreover, every definable…

Logic in Computer Science · Computer Science 2020-11-12 Nazanin Roshandel Tavana

From Mechanistic to Compositional Interpretability

Mechanistic interpretability aims to explain neural model behaviour by reverse-engineering learned computational structure into human-understandable components. Without a formal framework, however, mechanistic explanations cannot be…

Machine Learning · Computer Science 2026-05-12 Ward Gauderis , Thomas Dooms , Steven T. Holmer , Kola Ayonrinde , Geraint A. Wiggins

Foundations of Interpretable Models

We argue that existing definitions of interpretability are not actionable in that they fail to inform users about general, sound, and robust interpretable model design. This makes current interpretability research fundamentally ill-posed.…

Machine Learning · Computer Science 2025-08-04 Pietro Barbiero , Mateo Espinosa Zarlenga , Alberto Termine , Mateja Jamnik , Giuseppe Marra

Three notions of effective computation on $\mathbb{R}$

We compare three notions of effectiveness on uncountable structures. The first notion is that of a $\real$-computable structure, based on a model of computation proposed by Blum, Shub, and Smale, which uses full-precision real arithmetic.…

Logic · Mathematics 2008-09-01 Wesley Calvert

Learning Interpretable Concepts: Unifying Causal Representation Learning and Foundation Models

To build intelligent machine learning systems, there are two broad approaches. One approach is to build inherently interpretable models, as endeavored by the growing field of causal representation learning. The other approach is to build…

Machine Learning · Computer Science 2024-12-10 Goutham Rajendran , Simon Buchholz , Bryon Aragam , Bernhard Schölkopf , Pradeep Ravikumar

A Formal Framework to Characterize Interpretability of Procedures

We provide a novel notion of what it means to be interpretable, looking past the usual association with human understanding. Our key insight is that interpretability is not an absolute concept and so we define it relative to a target model,…

Artificial Intelligence · Computer Science 2017-07-14 Amit Dhurandhar , Vijay Iyengar , Ronny Luss , Karthikeyan Shanmugam

On the Relationship Between Interpretability and Explainability in Machine Learning

Interpretability and explainability have gained more and more attention in the field of machine learning as they are crucial when it comes to high-stakes decisions and troubleshooting. Since both provide information about predictors and…

Machine Learning · Computer Science 2024-04-26 Benjamin Leblanc , Pascal Germain

Interpretable Syntactic Representations Enable Hierarchical Word Vectors

The distributed representations currently used are dense and uninterpretable, leading to interpretations that themselves are relative, overcomplete, and hard to interpret. We propose a method that transforms these word vectors into reduced…

Computation and Language · Computer Science 2024-11-14 Biraj Silwal

Interpretability Can Be Actionable

Interpretability aims to explain the behavior of deep neural networks. Despite rapid growth, there is mounting concern that much of this work has not translated into practical impact, raising questions about its relevance and utility. This…

Machine Learning · Computer Science 2026-05-13 Hadas Orgad , Fazl Barez , Tal Haklay , Isabelle Lee , Marius Mosbach , Anja Reusch , Naomi Saphra , Byron Wallace , Sarah Wiegreffe , Eric Wong , Ian Tenney , Mor Geva

A Structural Approach to Reversible Computation

Reversibility is a key issue in the interface between computation and physics, and of growing importance as miniaturization progresses towards its physical limits. Most foundational work on reversible computing to date has focussed on…

Logic in Computer Science · Computer Science 2011-12-01 Samson Abramsky

A Computable Functor From Graphs to Fields

We construct a fully faithful functor from the category of graphs to the category of fields. Using this functor, we resolve a longstanding open problem in computable model theory, by showing that for every nontrivial countable structure S,…

Logic · Mathematics 2015-10-27 Russell Miller , Bjorn Poonen , Hans Schoutens , Alexandra Shlapentokh

Feature-Based Interpretable Surrogates for Optimization

For optimization models to be used in practice, it is crucial that users trust the results. A key factor in this aspect is the interpretability of the solution process. A previous framework for inherently interpretable optimization models…

Optimization and Control · Mathematics 2026-02-13 Marc Goerigk , Michael Hartisch , Sebastian Merten , Kartikey Sharma

A Disentangling Invertible Interpretation Network for Explaining Latent Representations

Neural networks have greatly boosted performance in computer vision by learning powerful representations of input data. The drawback of end-to-end training for maximal overall performance are black-box models whose hidden representations…

Computer Vision and Pattern Recognition · Computer Science 2020-04-29 Patrick Esser , Robin Rombach , Björn Ommer

Internalising modified realisability in constructive type theory

A modified realisability interpretation of infinitary logic is formalised and proved sound in constructive type theory (CTT). The logic considered subsumes first order logic. The interpretation makes it possible to extract programs with…

Logic · Mathematics 2017-01-11 Erik Palmgren