Superquantile-based learning: a direct approach using gradient-based optimization

Yassine Laguel; Jérôme Malick; Zaid Harchaoui

Superquantile-based learning: a direct approach using gradient-based optimization

Optimization and Control 2022-01-04 v1

Authors: Yassine Laguel , Jérôme Malick , Zaid Harchaoui

Abstract

We consider a formulation of supervised learning that endows models with robustness to distributional shifts from training to testing. The formulation hinges upon the superquantile risk measure, also known as the conditional value-at-risk, which has shown promise in recent applications of machine learning and signal processing. We show that, thanks to a direct smoothing of the superquantile function, a superquantile-based learning objective is amenable to gradient-based optimization, using batch optimization algorithms such as gradient descent or quasi-Newton algorithms, or using stochastic optimization algorithms such as stochastic gradient algorithms. A companion software SPQR implements in Python the algorithms described and allows practitioners to experiment with superquantile-based supervised learning.

Keywords

gradient descent optimization neural networks stochastic optimization

Cite

@article{arxiv.2201.00505,
  title  = {Superquantile-based learning: a direct approach using gradient-based optimization},
  author = {Yassine Laguel and Jérôme Malick and Zaid Harchaoui},
  journal= {arXiv preprint arXiv:2201.00505},
  year   = {2022}
}

Superquantile-based learning: a direct approach using gradient-based optimization

Abstract

Keywords

Cite

Related papers