Related papers: Specification Overfitting in Artificial Intelligen…

The Problem with Metrics is a Fundamental Problem for AI

Optimizing a given metric is a central aspect of most current AI approaches, yet overemphasizing metrics leads to manipulation, gaming, a myopic focus on short-term goals, and other unexpected negative consequences. This poses a fundamental…

Computers and Society · Computer Science 2020-02-21 Rachel Thomas , David Uminsky

Critical Appraisal of Fairness Metrics in Clinical Predictive AI

Predictive artificial intelligence (AI) offers an opportunity to improve clinical practice and patient outcomes, but risks perpetuating biases if fairness is inadequately addressed. However, the definition of "fairness" remains unclear. We…

Machine Learning · Computer Science 2025-06-23 João Matos , Ben Van Calster , Leo Anthony Celi , Paula Dhiman , Judy Wawira Gichoya , Richard D. Riley , Chris Russell , Sara Khalid , Gary S. Collins

Target specification bias, counterfactual prediction, and algorithmic fairness in healthcare

Bias in applications of machine learning (ML) to healthcare is usually attributed to unrepresentative or incomplete data, or to underlying health disparities. This article identifies a more pervasive source of bias that affects the clinical…

Machine Learning · Computer Science 2023-08-07 Eran Tal

Towards Fairness Certification in Artificial Intelligence

Thanks to the great progress of machine learning in the last years, several Artificial Intelligence (AI) techniques have been increasingly moving from the controlled research laboratory settings to our everyday life. AI is clearly…

Artificial Intelligence · Computer Science 2021-06-07 Tatiana Tommasi , Silvia Bucci , Barbara Caputo , Pietro Asinari

Inherent Limitations of AI Fairness

As the real-world impact of Artificial Intelligence (AI) systems has been steadily growing, so too have these systems come under increasing scrutiny. In response, the study of AI fairness has rapidly developed into a rich field of research…

Computers and Society · Computer Science 2023-09-19 Maarten Buyl , Tijl De Bie

Enhancing Formal Software Specification with Artificial Intelligence

Formal software specification is known to enable early error detection and explicit invariants, yet it has seen limited industrial adoption due to its high notation overhead and the expertise required to use traditional formal languages.…

Software Engineering · Computer Science 2026-01-16 Antonio Abu Nassar , Eitan Farchi

Algorithmic Fairness: Not a Purely Technical but Socio-Technical Property

The rapid trend of deploying artificial intelligence (AI) and machine learning (ML) systems in socially consequential domains has raised growing concerns about their trustworthiness, including potential discriminatory behaviours. Research…

Machine Learning · Computer Science 2025-09-22 Yijun Bian , Lei You , Yuya Sasaki , Haruka Maeda , Akira Igarashi

A Gray Literature Study on Fairness Requirements in AI-enabled Software Engineering

Today, with the growing obsession with applying Artificial Intelligence (AI), particularly Machine Learning (ML), to software across various contexts, much of the focus has been on the effectiveness of AI models, often measured through…

Software Engineering · Computer Science 2025-12-10 Thanh Nguyen , Chaima Boufaied , Ronnie de Souza Santos

Getting Fairness Right: Towards a Toolbox for Practitioners

The potential risk of AI systems unintentionally embedding and reproducing bias has attracted the attention of machine learning practitioners and society at large. As policy makers are willing to set the standards of algorithms and AI…

Artificial Intelligence · Computer Science 2020-03-17 Boris Ruf , Chaouki Boutharouite , Marcin Detyniecki

Safety by Measurement: A Systematic Literature Review of AI Safety Evaluation Methods

As frontier AI systems advance toward transformative capabilities, we need a parallel transformation in how we measure and evaluate these systems to ensure safety and inform governance. While benchmarks have been the primary method for…

Artificial Intelligence · Computer Science 2025-05-12 Markov Grey , Charbel-Raphaël Segerie

Inadequacies of Large Language Model Benchmarks in the Era of Generative Artificial Intelligence

The rapid rise in popularity of Large Language Models (LLMs) with emerging capabilities has spurred public curiosity to evaluate and compare different LLMs, leading many researchers to propose their own LLM benchmarks. Noticing preliminary…

Artificial Intelligence · Computer Science 2025-05-15 Timothy R. McIntosh , Teo Susnjak , Nalin Arachchilage , Tong Liu , Paul Watters , Malka N. Halgamuge

An Artificial Intelligence Value at Risk Approach: Metrics and Models

Artificial intelligence risks are multidimensional in nature, as the same risk scenarios may have legal, operational, and financial risk dimensions. With the emergence of new AI regulations, the state of the art of artificial intelligence…

Computers and Society · Computer Science 2025-09-24 Luis Enriquez Alvarez

(Unfair) Norms in Fairness Research: A Meta-Analysis

Algorithmic fairness has emerged as a critical concern in artificial intelligence (AI) research. However, the development of fair AI systems is not an objective process. Fairness is an inherently subjective concept, shaped by the values,…

Computers and Society · Computer Science 2024-07-25 Jennifer Chien , A. Stevie Bergman , Kevin R. McKee , Nenad Tomasev , Vinodkumar Prabhakaran , Rida Qadri , Nahema Marchal , William Isaac

Fairness-aware Configuration of Machine Learning Libraries

This paper investigates the parameter space of machine learning (ML) algorithms in aggravating or mitigating fairness bugs. Data-driven software is increasingly applied in social-critical applications where ensuring fairness is of paramount…

Software Engineering · Computer Science 2022-02-15 Saeid Tizpaz-Niari , Ashish Kumar , Gang Tan , Ashutosh Trivedi

Revisiting Technical Bias Mitigation Strategies

Efforts to mitigate bias and enhance fairness in the artificial intelligence (AI) community have predominantly focused on technical solutions. While numerous reviews have addressed bias in AI, this review uniquely focuses on the practical…

Artificial Intelligence · Computer Science 2024-10-24 Abdoul Jalil Djiberou Mahamadou , Artem A. Trotsyuk

Bias and unfairness in machine learning models: a systematic literature review

One of the difficulties of artificial intelligence is to ensure that model decisions are fair and free of bias. In research, datasets, metrics, techniques, and tools are applied to detect and mitigate algorithmic unfairness and bias. This…

Machine Learning · Computer Science 2022-11-04 Tiago Palma Pagano , Rafael Bessa Loureiro , Fernanda Vitória Nascimento Lisboa , Gustavo Oliveira Ramos Cruz , Rodrigo Matos Peixoto , Guilherme Aragão de Sousa Guimarães , Lucas Lisboa dos Santos , Maira Matos Araujo , Marco Cruz , Ewerton Lopes Silva de Oliveira , Ingrid Winkler , Erick Giovani Sperandio Nascimento

Can We Trust AI Benchmarks? An Interdisciplinary Review of Current Issues in AI Evaluation

Quantitative Artificial Intelligence (AI) Benchmarks have emerged as fundamental tools for evaluating the performance, capability, and safety of AI models and systems. Currently, they shape the direction of AI development and are playing an…

Artificial Intelligence · Computer Science 2025-05-27 Maria Eriksson , Erasmo Purificato , Arman Noroozian , Joao Vinagre , Guillaume Chaslot , Emilia Gomez , David Fernandez-Llorca

A Framework for Fairness: A Systematic Review of Existing Fair AI Solutions

In a world of daily emerging scientific inquisition and discovery, the prolific launch of machine learning across industries comes to little surprise for those familiar with the potential of ML. Neither so should the congruent expansion of…

Artificial Intelligence · Computer Science 2021-12-13 Brianna Richardson , Juan E. Gilbert

Goal Misgeneralization: Why Correct Specifications Aren't Enough For Correct Goals

The field of AI alignment is concerned with AI systems that pursue unintended goals. One commonly studied mechanism by which an unintended goal might arise is specification gaming, in which the designer-provided specification is flawed in a…

Machine Learning · Computer Science 2022-11-03 Rohin Shah , Vikrant Varma , Ramana Kumar , Mary Phuong , Victoria Krakovna , Jonathan Uesato , Zac Kenton

Joint Optimization of AI Fairness and Utility: A Human-Centered Approach

Today, AI is increasingly being used in many high-stakes decision-making applications in which fairness is an important concern. Already, there are many examples of AI being biased and making questionable and unfair decisions. The AI…

Artificial Intelligence · Computer Science 2020-02-06 Yunfeng Zhang , Rachel K. E. Bellamy , Kush R. Varshney