English
Related papers

Related papers: Counterfactually Fair Reinforcement Learning via S…

200 papers

In the field of explainable Artificial Intelligence (XAI), sequential counterfactual (SCF) examples are often used to alter the decision of a trained classifier by implementing a sequence of modifications to the input instance. Although…

Machine Learning · Computer Science 2023-11-02 E. Panagiotou , E. Ntoutsi

Reinforcement learning (RL) algorithms usually require a substantial amount of interaction data and perform well only for specific tasks in a fixed environment. In some scenarios such as healthcare, however, usually only few records are…

Machine Learning · Computer Science 2020-12-17 Chaochao Lu , Biwei Huang , Ke Wang , José Miguel Hernández-Lobato , Kun Zhang , Bernhard Schölkopf

Reinforcement learning (RL) aims to learn and evaluate a sequential decision rule, often referred to as a "policy", that maximizes the population-level benefit in an environment across possibly infinitely many time steps. However, the…

Machine Learning · Statistics 2025-10-09 Jianhan Zhang , Jitao Wang , Chengchun Shi , John D. Piette , Donglin Zeng , Zhenke Wu

The use of machine learning models in high-stake applications (e.g., healthcare, lending, college admission) has raised growing concerns due to potential biases against protected social groups. Various fairness notions and methods have been…

Machine Learning · Computer Science 2023-11-10 Zhiqun Zuo , Mohammad Mahdi Khalili , Xueru Zhang

Fairness-aware learning aims at satisfying various fairness constraints in addition to the usual performance criteria via data-driven machine learning techniques. Most of the research in fairness-aware learning employs the setting of…

Machine Learning · Computer Science 2022-05-23 Pratik Gajane , Akrati Saxena , Maryam Tavakol , George Fletcher , Mykola Pechenizkiy

AI methods are used in societally important settings, ranging from credit to employment to housing, and it is crucial to provide fairness in regard to algorithmic decision making. Moreover, many settings are dynamic, with populations…

Machine Learning · Computer Science 2022-11-09 Zhun Deng , He Sun , Zhiwei Steven Wu , Linjun Zhang , David C. Parkes

Reinforcement learning algorithms are generally designed to maximize the expected return across a population. However, a policy that is optimal on average may be suboptimal for certain individuals, leading to potential safety concerns. To…

Machine Learning · Statistics 2026-05-26 Jingyi Li , Peng Wu , Chengchun Shi

In high-stake domains such as healthcare and hiring, the role of machine learning (ML) in decision-making raises significant fairness concerns. This work focuses on Counterfactual Fairness (CF), which posits that an ML model's outcome on…

Machine Learning · Computer Science 2025-01-23 Zeyu Zhou , Tianci Liu , Ruqi Bai , Jing Gao , Murat Kocaoglu , David I. Inouye

Reinforcement Learning (RL) has shown great promise in domains like healthcare and robotics but often struggles with adoption due to its lack of interpretability. Counterfactual explanations, which address "what if" scenarios, provide a…

Machine Learning · Computer Science 2025-05-20 Shuyang Dong , Shangtong Zhang , Lu Feng

Machine learning has become more important in real-life decision-making but people are concerned about the ethical problems it may bring when used improperly. Recent work brings the discussion of machine learning fairness into the causal…

Machine Learning · Statistics 2022-02-28 Haoyu Chen , Wenbin Lu , Rui Song , Pulak Ghosh

As machine learning (ML) algorithms are used in applications that involve humans, concerns have arisen that these algorithms may be biased against certain social groups. \textit{Counterfactual fairness} (CF) is a fairness notion proposed in…

Machine Learning · Computer Science 2024-12-03 Zhiqun Zuo , Tian Xie , Xuwei Tan , Xueru Zhang , Mohammad Mahdi Khalili

Machine learning can impact people with legal or ethical consequences when it is used to automate decisions in areas such as insurance, lending, hiring, and predictive policing. In many of these scenarios, previous decisions have been made…

Machine Learning · Statistics 2018-03-09 Matt J. Kusner , Joshua R. Loftus , Chris Russell , Ricardo Silva

Deep Reinforcement Learning (DRL) has demonstrated promising capability in solving complex control problems. However, DRL applications in safety-critical systems are hindered by the inherent lack of robust verification techniques to assure…

Machine Learning · Computer Science 2023-10-10 Amir Samadi , Konstantinos Koufos , Kurt Debattista , Mehrdad Dianati

Counterfactual thinking describes a psychological phenomenon that people re-infer the possible results with different solutions about things that have already happened. It helps people to gain more experience from mistakes and thus to…

Machine Learning · Computer Science 2019-08-19 Yue Wang , Yao Wan , Chenwei Zhang , Lixin Cui , Lu Bai , Philip S. Yu

The use of machine learning systems to support decision making in healthcare raises questions as to what extent these systems may introduce or exacerbate disparities in care for historically underrepresented and mistreated groups, due to…

Machine Learning · Computer Science 2019-07-16 Stephen Pfohl , Tony Duan , Daisy Yi Ding , Nigam H. Shah

As Reinforcement Learning (RL) agents are increasingly employed in diverse decision-making problems using reward preferences, it becomes important to ensure that policies learned by these frameworks in mapping observations to a probability…

Artificial Intelligence · Computer Science 2023-07-26 Shripad V. Deshmukh , Srivatsan R , Supriti Vijay , Jayakumar Subramanian , Chirag Agarwal

Standard reinforcement learning (RL) aims to find an optimal policy that identifies the best action for each state. However, in healthcare settings, many actions may be near-equivalent with respect to the reward (e.g., survival). We…

Machine Learning · Computer Science 2020-07-27 Shengpu Tang , Aditya Modi , Michael W. Sjoding , Jenna Wiens

Counterfactual examples (CFs) are one of the most popular methods for attaching post-hoc explanations to machine learning (ML) models. However, existing CF generation methods either exploit the internals of specific models or depend on each…

Machine Learning · Computer Science 2023-08-10 Ziheng Chen , Fabrizio Silvestri , Jia Wang , He Zhu , Hongshik Ahn , Gabriele Tolomei

In reinforcement learning with human feedback (RLHF), reward models can efficiently learn and amplify latent biases within multimodal datasets, which can lead to imperfect policy optimization through flawed reward signals and decreased…

Machine Learning · Computer Science 2025-08-28 Sheryl Mathew , N Harshit

The recent success of supervised learning methods on ever larger offline datasets has spurred interest in the reinforcement learning (RL) field to investigate whether the same paradigms can be translated to RL algorithms. This research…

Machine Learning · Computer Science 2021-02-12 Mengjiao Yang , Ofir Nachum
‹ Prev 1 2 3 10 Next ›