Related papers: Counterfactually Fair Reinforcement Learning via S…

Learning impartial policies for sequential counterfactual explanations using Deep Reinforcement Learning

In the field of explainable Artificial Intelligence (XAI), sequential counterfactual (SCF) examples are often used to alter the decision of a trained classifier by implementing a sequence of modifications to the input instance. Although…

Machine Learning · Computer Science 2023-11-02 E. Panagiotou , E. Ntoutsi

Sample-Efficient Reinforcement Learning via Counterfactual-Based Data Augmentation

Reinforcement learning (RL) algorithms usually require a substantial amount of interaction data and perform well only for specific tasks in a fixed environment. In some scenarios such as healthcare, however, usually only few records are…

Machine Learning · Computer Science 2020-12-17 Chaochao Lu , Biwei Huang , Ke Wang , José Miguel Hernández-Lobato , Kun Zhang , Bernhard Schölkopf

PyCFRL: A Python library for counterfactually fair offline reinforcement learning via sequential data preprocessing

Reinforcement learning (RL) aims to learn and evaluate a sequential decision rule, often referred to as a "policy", that maximizes the population-level benefit in an environment across possibly infinitely many time steps. However, the…

Machine Learning · Statistics 2025-10-09 Jianhan Zhang , Jitao Wang , Chengchun Shi , John D. Piette , Donglin Zeng , Zhenke Wu

Counterfactually Fair Representation

The use of machine learning models in high-stake applications (e.g., healthcare, lending, college admission) has raised growing concerns due to potential biases against protected social groups. Various fairness notions and methods have been…

Machine Learning · Computer Science 2023-11-10 Zhiqun Zuo , Mohammad Mahdi Khalili , Xueru Zhang

Survey on Fair Reinforcement Learning: Theory and Practice

Fairness-aware learning aims at satisfying various fairness constraints in addition to the usual performance criteria via data-driven machine learning techniques. Most of the research in fairness-aware learning employs the setting of…

Machine Learning · Computer Science 2022-05-23 Pratik Gajane , Akrati Saxena , Maryam Tavakol , George Fletcher , Mykola Pechenizkiy

Reinforcement Learning with Stepwise Fairness Constraints

AI methods are used in societally important settings, ranging from credit to employment to housing, and it is crucial to provide fairness in regard to algorithmic decision making. Moreover, many settings are dynamic, with populations…

Machine Learning · Computer Science 2022-11-09 Zhun Deng , He Sun , Zhiwei Steven Wu , Linjun Zhang , David C. Parkes

Counterfactually Safe Reinforcement Learning

Reinforcement learning algorithms are generally designed to maximize the expected return across a population. However, a policy that is optimal on average may be suboptimal for certain individuals, leading to potential safety concerns. To…

Machine Learning · Statistics 2026-05-26 Jingyi Li , Peng Wu , Chengchun Shi

Counterfactual Fairness by Combining Factual and Counterfactual Predictions

In high-stake domains such as healthcare and hiring, the role of machine learning (ML) in decision-making raises significant fairness concerns. This work focuses on Counterfactual Fairness (CF), which posits that an ML model's outcome on…

Machine Learning · Computer Science 2025-01-23 Zeyu Zhou , Tianci Liu , Ruqi Bai , Jing Gao , Murat Kocaoglu , David I. Inouye

Counterfactual Explanations for Continuous Action Reinforcement Learning

Reinforcement Learning (RL) has shown great promise in domains like healthcare and robotics but often struggles with adoption due to its lack of interpretability. Counterfactual explanations, which address "what if" scenarios, provide a…

Machine Learning · Computer Science 2025-05-20 Shuyang Dong , Shangtong Zhang , Lu Feng

On Learning and Testing of Counterfactual Fairness through Data Preprocessing

Machine learning has become more important in real-life decision-making but people are concerned about the ethical problems it may bring when used improperly. Recent work brings the discussion of machine learning fairness into the causal…

Machine Learning · Statistics 2022-02-28 Haoyu Chen , Wenbin Lu , Rui Song , Pulak Ghosh

Lookahead Counterfactual Fairness

As machine learning (ML) algorithms are used in applications that involve humans, concerns have arisen that these algorithms may be biased against certain social groups. \textit{Counterfactual fairness} (CF) is a fairness notion proposed in…

Machine Learning · Computer Science 2024-12-03 Zhiqun Zuo , Tian Xie , Xuwei Tan , Xueru Zhang , Mohammad Mahdi Khalili

Counterfactual Fairness

Machine learning can impact people with legal or ethical consequences when it is used to automate decisions in areas such as insurance, lending, hiring, and predictive policing. In many of these scenarios, previous decisions have been made…

Machine Learning · Statistics 2018-03-09 Matt J. Kusner , Joshua R. Loftus , Chris Russell , Ricardo Silva

Counterfactual Explainer Framework for Deep Reinforcement Learning Models Using Policy Distillation

Deep Reinforcement Learning (DRL) has demonstrated promising capability in solving complex control problems. However, DRL applications in safety-critical systems are hindered by the inherent lack of robust verification techniques to assure…

Machine Learning · Computer Science 2023-10-10 Amir Samadi , Konstantinos Koufos , Kurt Debattista , Mehrdad Dianati

Competitive Multi-Agent Deep Reinforcement Learning with Counterfactual Thinking

Counterfactual thinking describes a psychological phenomenon that people re-infer the possible results with different solutions about things that have already happened. It helps people to gain more experience from mistakes and thus to…

Machine Learning · Computer Science 2019-08-19 Yue Wang , Yao Wan , Chenwei Zhang , Lixin Cui , Lu Bai , Philip S. Yu

Counterfactual Reasoning for Fair Clinical Risk Prediction

The use of machine learning systems to support decision making in healthcare raises questions as to what extent these systems may introduce or exacerbate disparities in care for historically underrepresented and mistreated groups, due to…

Machine Learning · Computer Science 2019-07-16 Stephen Pfohl , Tony Duan , Daisy Yi Ding , Nigam H. Shah

Counterfactual Explanation Policies in RL

As Reinforcement Learning (RL) agents are increasingly employed in diverse decision-making problems using reward preferences, it becomes important to ensure that policies learned by these frameworks in mapping observations to a probability…

Artificial Intelligence · Computer Science 2023-07-26 Shripad V. Deshmukh , Srivatsan R , Supriti Vijay , Jayakumar Subramanian , Chirag Agarwal

Clinician-in-the-Loop Decision Making: Reinforcement Learning with Near-Optimal Set-Valued Policies

Standard reinforcement learning (RL) aims to find an optimal policy that identifies the best action for each state. However, in healthcare settings, many actions may be near-equivalent with respect to the reward (e.g., survival). We…

Machine Learning · Computer Science 2020-07-27 Shengpu Tang , Aditya Modi , Michael W. Sjoding , Jenna Wiens

ReLAX: Reinforcement Learning Agent eXplainer for Arbitrary Predictive Models

Counterfactual examples (CFs) are one of the most popular methods for attaching post-hoc explanations to machine learning (ML) models. However, existing CF generation methods either exploit the internals of specific models or depend on each…

Machine Learning · Computer Science 2023-08-10 Ziheng Chen , Fabrizio Silvestri , Jia Wang , He Zhu , Hongshik Ahn , Gabriele Tolomei

Counterfactual Reward Model Training for Bias Mitigation in Multimodal Reinforcement Learning

In reinforcement learning with human feedback (RLHF), reward models can efficiently learn and amplify latent biases within multimodal datasets, which can lead to imperfect policy optimization through flawed reward signals and decreased…

Machine Learning · Computer Science 2025-08-28 Sheryl Mathew , N Harshit

Representation Matters: Offline Pretraining for Sequential Decision Making

The recent success of supervised learning methods on ever larger offline datasets has spurred interest in the reinforcement learning (RL) field to investigate whether the same paradigms can be translated to RL algorithms. This research…

Machine Learning · Computer Science 2021-02-12 Mengjiao Yang , Ofir Nachum