Related papers: A Performance-Explainability Framework to Benchmar…

Counterfactual Explanations for Machine Learning on Multivariate Time Series Data

Applying machine learning (ML) on multivariate time series data has growing popularity in many application domains, including in computer system management. For example, recent high performance computing (HPC) research proposes a variety of…

Machine Learning · Computer Science 2021-08-20 Emre Ates , Burak Aksar , Vitus J. Leung , Ayse K. Coskun

XTSC-Bench: Quantitative Benchmarking for Explainers on Time Series Classification

Despite the growing body of work on explainable machine learning in time series classification (TSC), it remains unclear how to evaluate different explainability methods. Resorting to qualitative assessment and user studies to evaluate…

Machine Learning · Computer Science 2023-10-24 Jacqueline Höllig , Steffen Thoma , Florian Grimm

timeXplain -- A Framework for Explaining the Predictions of Time Series Classifiers

Modern time series classifiers display impressive predictive capabilities, yet their decision-making processes mostly remain black boxes to the user. At the same time, model-agnostic explainers, such as the recently proposed SHAP, promise…

Machine Learning · Computer Science 2023-11-21 Felix Mujkanovic , Vanja Doskoč , Martin Schirneck , Patrick Schäfer , Tobias Friedrich

Explainable Multivariate Time Series Classification: A Deep Neural Network Which Learns To Attend To Important Variables As Well As Informative Time Intervals

Time series data is prevalent in a wide variety of real-world applications and it calls for trustworthy and explainable models for people to understand and fully trust decisions made by AI solutions. We consider the problem of building…

Machine Learning · Computer Science 2020-11-25 Tsung-Yu Hsieh , Suhang Wang , Yiwei Sun , Vasant Honavar

ExplainBench: A Benchmark Framework for Local Model Explanations in Fairness-Critical Applications

As machine learning systems are increasingly deployed in high-stakes domains such as criminal justice, finance, and healthcare, the demand for interpretable and trustworthy models has intensified. Despite the proliferation of local…

Machine Learning · Computer Science 2025-06-10 James Afful

Explainable Benchmarking for Iterative Optimization Heuristics

Benchmarking heuristic algorithms is vital to understand under which conditions and on what kind of problems certain algorithms perform well. In most current research into heuristic optimization algorithms, only a very limited number of…

Neural and Evolutionary Computing · Computer Science 2024-02-26 Niki van Stein , Diederick Vermetten , Anna V. Kononova , Thomas Bäck

A Theoretical Framework for Adaptive Utility-Weighted Benchmarking

Benchmarking has long served as a foundational practice in machine learning and, increasingly, in modern AI systems such as large language models, where shared tasks, metrics, and leaderboards offer a common basis for measuring progress and…

Artificial Intelligence · Computer Science 2026-02-16 Philip Waggoner

BenchML: an extensible pipelining framework for benchmarking representations of materials and molecules at scale

We introduce a machine-learning (ML) framework for high-throughput benchmarking of diverse representations of chemical systems against datasets of materials and molecules. The guiding principle underlying the benchmarking approach is to…

Machine Learning · Computer Science 2021-12-07 Carl Poelking , Felix A. Faber , Bingqing Cheng

Explainability Fact Sheets: A Framework for Systematic Assessment of Explainable Approaches

Explanations in Machine Learning come in many forms, but a consensus regarding their desired properties is yet to emerge. In this paper we introduce a taxonomy and a set of descriptors that can be used to characterise and systematically…

Machine Learning · Computer Science 2019-12-12 Kacper Sokol , Peter Flach

TSPP: A Unified Benchmarking Tool for Time-series Forecasting

While machine learning has witnessed significant advancements, the emphasis has largely been on data acquisition and model creation. However, achieving a comprehensive assessment of machine learning solutions in real-world settings…

Machine Learning · Computer Science 2024-01-09 Jan Bączek , Dmytro Zhylko , Gilberto Titericz , Sajad Darabi , Jean-Francois Puget , Izzy Putterman , Dawid Majchrowski , Anmol Gupta , Kyle Kranen , Pawel Morkisz

A Comparative Evaluation of Log-Based Process Performance Analysis Techniques

Process mining has gained traction over the past decade and an impressive body of research has resulted in the introduction of a variety of process mining approaches measuring process performance. Having this set of techniques available,…

Performance · Computer Science 2018-04-12 Fredrik Milani , Fabrizio M. Maggi

A Unified Study of Machine Learning Explanation Evaluation Metrics

The growing need for trustworthy machine learning has led to the blossom of interpretability research. Numerous explanation methods have been developed to serve this purpose. However, these methods are deficiently and inappropriately…

Machine Learning · Computer Science 2022-03-29 Yipei Wang , Xiaoqian Wang

How to show a probabilistic model is better

We present a simple theoretical framework, and corresponding practical procedures, for comparing probabilistic models on real data in a traditional machine learning setting. This framework is based on the theory of proper scoring rules, but…

Machine Learning · Statistics 2015-02-13 Mithun Chakraborty , Sanmay Das , Allen Lavoie

Explainable Benchmarking through the Lense of Concept Learning

Evaluating competing systems in a comparable way, i.e., benchmarking them, is an undeniable pillar of the scientific method. However, system performance is often summarized via a small number of metrics. The analysis of the evaluation…

Machine Learning · Computer Science 2025-10-24 Quannian Zhang , Michael Röder , Nikit Srivastava , N'Dah Jean Kouagou , Axel-Cyrille Ngonga Ngomo

A Multi-Objective Evaluation Framework for Analyzing Utility-Fairness Trade-Offs in Machine Learning Systems

The evaluation of fairness models in Machine Learning involves complex challenges, such as defining appropriate metrics, balancing trade-offs between utility and fairness, and there are still gaps in this stage. This work presents a novel…

Machine Learning · Computer Science 2026-03-03 Gökhan Özbulak , Oscar Jimenez-del-Toro , Maíra Fatoretto , Lilian Berton , André Anjos

Improving the Validity and Practical Usefulness of AI/ML Evaluations Using an Estimands Framework

Commonly, AI or machine learning (ML) models are evaluated on benchmark datasets. This practice supports innovative methodological research, but benchmark performance can be poorly correlated with performance in real-world applications -- a…

Machine Learning · Computer Science 2024-06-18 Olivier Binette , Jerome P. Reiter

Multi-Stage Prototype Learning for Interpretable Time Series Classification

Deep learning methods are powerful tools in classifying multivariate time series data. Despite their high performance, these methods are hard to interpret, which diminishes their applications in high-risk domains such as healthcare. In this…

Machine Learning · Computer Science 2026-05-11 Bhavesh Kalisetti , Vincent Wang , Gaurav R. Ghosal , Maryam Bijanzadeh , Reza Abbasi-Asl

Explainability-Driven Quality Assessment for Rule-Based Systems

This paper introduces an explanation framework designed to enhance the quality of rules in knowledge-based reasoning systems based on dataset-driven insights. The traditional method for rule induction from data typically requires…

Artificial Intelligence · Computer Science 2025-02-04 Oshani Seneviratne , Brendan Capuzzo , William Van Woensel

Reconnoitering the class distinguishing abilities of the features, to know them better

The relevance of machine learning (ML) in our daily lives is closely intertwined with its explainability. Explainability can allow end-users to have a transparent and humane reckoning of a ML scheme's capability and utility. It will also…

Machine Learning · Computer Science 2022-12-27 Payel Sadhukhan , Sarbani palit , Kausik Sengupta

Improving Network Interpretability via Explanation Consistency Evaluation

While deep neural networks have achieved remarkable performance, they tend to lack transparency in prediction. The pursuit of greater interpretability in neural networks often results in a degradation of their original performance. Some…

Computer Vision and Pattern Recognition · Computer Science 2024-08-09 Hefeng Wu , Hao Jiang , Keze Wang , Ziyi Tang , Xianghuan He , Liang Lin