Related papers: A Framework for Auditing Multilevel Models using E…

An Audit Framework for Technical Assessment of Binary Classifiers

Multilevel models using logistic regression (MLogRM) and random forest models (RFM) are increasingly deployed in industry for the purpose of binary classification. The European Commission's proposed Artificial Intelligence Act (AIA)…

Computers and Society · Computer Science 2022-11-18 Debarati Bhaumik , Diptish Dey

Unlocking the Black Box: A Five-Dimensional Framework for Evaluating Explainable AI in Credit Risk

The financial industry faces a significant challenge modeling and risk portfolios: balancing the predictability of advanced machine learning models, neural network models, and explainability required by regulatory entities (such as Office…

Machine Learning · Computer Science 2025-11-10 Rongbin Ye , Jiaqi Chen

Quantifying Transparency of Machine Learning Systems through Analysis of Contributions

Increased adoption and deployment of machine learning (ML) models into business, healthcare and other organisational processes, will result in a growing disconnect between the engineers and researchers who developed the models and the…

Machine Learning · Computer Science 2019-07-09 Iain Barclay , Alun Preece , Ian Taylor , Dinesh Verma

Auditing large language models: a three-layered approach

Large language models (LLMs) represent a major advance in artificial intelligence (AI) research. However, the widespread use of LLMs is also coupled with significant ethical and social challenges. Previous research has pointed towards…

Computation and Language · Computer Science 2023-06-28 Jakob Mökander , Jonas Schuett , Hannah Rose Kirk , Luciano Floridi

Enhancing Trust in LLMs: Algorithms for Comparing and Interpreting LLMs

This paper surveys evaluation techniques to enhance the trustworthiness and understanding of Large Language Models (LLMs). As reliance on LLMs grows, ensuring their reliability, fairness, and transparency is crucial. We explore algorithmic…

Computation and Language · Computer Science 2024-06-05 Nik Bear Brown

Reliable fairness auditing with semi-supervised inference

Machine learning (ML) models often exhibit bias that can exacerbate inequities in biomedical applications. Fairness auditing, the process of evaluating a model's performance across subpopulations, is critical for identifying and mitigating…

Methodology · Statistics 2026-05-19 Jianhui Gao , Jessica Gronsbell

A Scalable Entity-Based Framework for Auditing Bias in LLMs

Existing approaches to bias evaluation in large language models (LLMs) trade ecological validity for statistical control, relying either on artificial prompts that poorly reflect real-world use or on naturalistic tasks that lack scale and…

Computation and Language · Computer Science 2026-05-12 Akram Elbouanani , Aboubacar Tuo , Adrian Popescu

Explainability Fact Sheets: A Framework for Systematic Assessment of Explainable Approaches

Explanations in Machine Learning come in many forms, but a consensus regarding their desired properties is yet to emerge. In this paper we introduce a taxonomy and a set of descriptors that can be used to characterise and systematically…

Machine Learning · Computer Science 2019-12-12 Kacper Sokol , Peter Flach

Evaluating Explainability: A Framework for Systematic Assessment and Reporting of Explainable AI Features

Explainability features are intended to provide insight into the internal mechanisms of an AI device, but there is a lack of evaluation techniques for assessing the quality of provided explanations. We propose a framework to assess and…

Artificial Intelligence · Computer Science 2025-06-18 Miguel A. Lago , Ghada Zamzmi , Brandon Eich , Jana G. Delfino

Making Fair ML Software using Trustworthy Explanation

Machine learning software is being used in many applications (finance, hiring, admissions, criminal justice) having a huge social impact. But sometimes the behavior of this software is biased and it shows discrimination based on some…

Software Engineering · Computer Science 2020-08-31 Joymallya Chakraborty , Kewen Peng , Tim Menzies

The Alignment Auditor: A Bayesian Framework for Verifying and Refining LLM Objectives

The objectives that Large Language Models (LLMs) implicitly optimize remain dangerously opaque, making trustworthy alignment and auditing a grand challenge. While Inverse Reinforcement Learning (IRL) can infer reward functions from…

Machine Learning · Computer Science 2025-10-09 Matthieu Bou , Nyal Patel , Arjun Jagota , Satyapriya Krishna , Sonali Parbhoo

Explainability Auditing for Intelligent Systems: A Rationale for Multi-Disciplinary Perspectives

National and international guidelines for trustworthy artificial intelligence (AI) consider explainability to be a central facet of trustworthy systems. This paper outlines a multi-disciplinary rationale for explainability auditing.…

Computers and Society · Computer Science 2025-04-22 Markus Langer , Kevin Baum , Kathrin Hartmann , Stefan Hessel , Timo Speith , Jonas Wahl

Smart Audit System Empowered by LLM

Manufacturing quality audits are pivotal for ensuring high product standards in mass production environments. Traditional auditing processes, however, are labor-intensive and reliant on human expertise, posing challenges in maintaining…

Computation and Language · Computer Science 2024-10-11 Xu Yao , Xiaoxu Wu , Xi Li , Huan Xu , Chenlei Li , Ping Huang , Si Li , Xiaoning Ma , Jiulong Shan

Towards a multi-stakeholder value-based assessment framework for algorithmic systems

In an effort to regulate Machine Learning-driven (ML) systems, current auditing processes mostly focus on detecting harmful algorithmic biases. While these strategies have proven to be impactful, some values outlined in documents dealing…

Machine Learning · Computer Science 2022-06-20 Mireia Yurrita , Dave Murray-Rust , Agathe Balayn , Alessandro Bozzon

Assessing the Auditability of AI-integrating Systems: A Framework and Learning Analytics Case Study

Audits contribute to the trustworthiness of Learning Analytics (LA) systems that integrate Artificial Intelligence (AI) and may be legally required in the future. We argue that the efficacy of an audit depends on the auditability of the…

Computers and Society · Computer Science 2024-11-15 Linda Fernsel , Yannick Kalff , Katharina Simbeck

Achieving Transparency in Distributed Machine Learning with Explainable Data Collaboration

Transparency of Machine Learning models used for decision support in various industries becomes essential for ensuring their ethical use. To that end, feature attribution methods such as SHAP (SHapley Additive exPlanations) are widely used…

Machine Learning · Computer Science 2022-12-08 Anna Bogdanova , Akira Imakura , Tetsuya Sakurai , Tomoya Fujii , Teppei Sakamoto , Hiroyuki Abe

ExplainBench: A Benchmark Framework for Local Model Explanations in Fairness-Critical Applications

As machine learning systems are increasingly deployed in high-stakes domains such as criminal justice, finance, and healthcare, the demand for interpretable and trustworthy models has intensified. Despite the proliferation of local…

Machine Learning · Computer Science 2025-06-10 James Afful

The Road to Explainability is Paved with Bias: Measuring the Fairness of Explanations

Machine learning models in safety-critical settings like healthcare are often blackboxes: they contain a large number of parameters which are not transparent to users. Post-hoc explainability methods where a simple, human-interpretable…

Machine Learning · Computer Science 2022-06-03 Aparna Balagopalan , Haoran Zhang , Kimia Hamidieh , Thomas Hartvigsen , Frank Rudzicz , Marzyeh Ghassemi

An ExplainableFair Framework for Prediction of Substance Use Disorder Treatment Completion

Fairness of machine learning models in healthcare has drawn increasing attention from clinicians, researchers, and even at the highest level of government. On the other hand, the importance of developing and deploying interpretable or…

Machine Learning · Computer Science 2024-09-04 Mary M. Lucas , Xiaoyang Wang , Chia-Hsuan Chang , Christopher C. Yang , Jacqueline E. Braughton , Quyen M. Ngo

Flexible and Context-Specific AI Explainability: A Multidisciplinary Approach

The recent enthusiasm for artificial intelligence (AI) is due principally to advances in deep learning. Deep learning methods are remarkably accurate, but also opaque, which limits their potential use in safety-critical applications. To…

Computers and Society · Computer Science 2020-03-18 Valérie Beaudouin , Isabelle Bloch , David Bounie , Stéphan Clémençon , Florence d'Alché-Buc , James Eagan , Winston Maxwell , Pavlo Mozharovskyi , Jayneel Parekh