English

Fair Submodular Cover

Machine Learning 2024-07-09 v1 Computers and Society Data Structures and Algorithms

Abstract

Submodular optimization is a fundamental problem with many applications in machine learning, often involving decision-making over datasets with sensitive attributes such as gender or age. In such settings, it is often desirable to produce a diverse solution set that is fairly distributed with respect to these attributes. Motivated by this, we initiate the study of Fair Submodular Cover (FSC), where given a ground set UU, a monotone submodular function f:2UR0f:2^U\to\mathbb{R}_{\ge 0}, a threshold τ\tau, the goal is to find a balanced subset of SS with minimum cardinality such that f(S)τf(S)\ge\tau. We first introduce discrete algorithms for FSC that achieve a bicriteria approximation ratio of (1ϵ,1O(ϵ))(\frac{1}{\epsilon}, 1-O(\epsilon)). We then present a continuous algorithm that achieves a (ln1ϵ,1O(ϵ))(\ln\frac{1}{\epsilon}, 1-O(\epsilon))-bicriteria approximation ratio, which matches the best approximation guarantee of submodular cover without a fairness constraint. Finally, we complement our theoretical results with a number of empirical evaluations that demonstrate the effectiveness of our algorithms on instances of maximum coverage.

Keywords

Cite

@article{arxiv.2407.04804,
  title  = {Fair Submodular Cover},
  author = {Wenjing Chen and Shuo Xing and Samson Zhou and Victoria G. Crawford},
  journal= {arXiv preprint arXiv:2407.04804},
  year   = {2024}
}