c-lasso -- a Python package for constrained sparse and robust regression and classification
Abstract
We introduce c-lasso, a Python package that enables sparse and robust linear regression and classification with linear equality constraints. The underlying statistical forward model is assumed to be of the following form: Here, is a given design matrix and the vector is a continuous or binary response vector. The matrix is a general constraint matrix. The vector contains the unknown coefficients and an unknown scale. Prominent use cases are (sparse) log-contrast regression with compositional data , requiring the constraint (Aitchion and Bacon-Shone 1984) and the Generalized Lasso which is a special case of the described problem (see, e.g, (James, Paulson, and Rusmevichientong 2020), Example 3). The c-lasso package provides estimators for inferring unknown coefficients and scale (i.e., perspective M-estimators (Combettes and M\"uller 2020a)) of the form for several convex loss functions . This includes the constrained Lasso, the constrained scaled Lasso, and sparse Huber M-estimators with linear equality constraints.
Cite
@article{arxiv.2011.00898,
title = {c-lasso -- a Python package for constrained sparse and robust regression and classification},
author = {Léo Simpson and Patrick L. Combettes and Christian L. Müller},
journal= {arXiv preprint arXiv:2011.00898},
year = {2020}
}