English

Attribute noise robust binary classification

Machine Learning 2019-11-20 v1 Machine Learning

Abstract

We consider the problem of learning linear classifiers when both features and labels are binary. In addition, the features are noisy, i.e., they could be flipped with an unknown probability. In Sy-De attribute noise model, where all features could be noisy together with same probability, we show that 00-11 loss (l01l_{0-1}) need not be robust but a popular surrogate, squared loss (lsql_{sq}) is. In Asy-In attribute noise model, we prove that l01l_{0-1} is robust for any distribution over 2 dimensional feature space. However, due to computational intractability of l01l_{0-1}, we resort to lsql_{sq} and observe that it need not be Asy-In noise robust. Our empirical results support Sy-De robustness of squared loss for low to moderate noise rates.

Keywords

Cite

@article{arxiv.1911.07875,
  title  = {Attribute noise robust binary classification},
  author = {Aditya Petety and Sandhya Tripathi and N Hemachandra},
  journal= {arXiv preprint arXiv:1911.07875},
  year   = {2019}
}

Comments

Accepted for Student Abstract presentation at AAAI2020

R2 v1 2026-06-23T12:19:47.439Z