Proxy Non-Discrimination in Data-Driven Systems

Anupam Datta; Matt Fredrikson; Gihyuk Ko; Piotr Mardziel; Shayak Sen

Proxy Non-Discrimination in Data-Driven Systems

Computers and Society 2018-03-22 v1 Machine Learning

Authors: Anupam Datta , Matt Fredrikson , Gihyuk Ko , Piotr Mardziel , Shayak Sen

Abstract

Machine learnt systems inherit biases against protected classes, historically disparaged groups, from training data. Usually, these biases are not explicit, they rely on subtle correlations discovered by training algorithms, and are therefore difficult to detect. We formalize proxy discrimination in data-driven systems, a class of properties indicative of bias, as the presence of protected class correlates that have causal influence on the system's output. We evaluate an implementation on a corpus of social datasets, demonstrating how to validate systems against these properties and to repair violations where they occur.

Keywords

fairness in machine learning machine learning theory machine learning

Cite

@article{arxiv.1707.08120,
  title  = {Proxy Non-Discrimination in Data-Driven Systems},
  author = {Anupam Datta and Matt Fredrikson and Gihyuk Ko and Piotr Mardziel and Shayak Sen},
  journal= {arXiv preprint arXiv:1707.08120},
  year   = {2018}
}

Comments

arXiv admin note: substantial text overlap with arXiv:1705.07807

Proxy Non-Discrimination in Data-Driven Systems

Abstract

Keywords

Cite

Comments

Related papers