Related papers: Distributed Sparse Multicategory Discriminant Anal…

Multiclass Sparse Discriminant Analysis

In recent years many sparse linear discriminant analysis methods have been proposed for high-dimensional classification and variable selection. However, most of these proposals focus on binary classification and they are not directly…

Methodology · Statistics 2015-04-23 Qing Mai , Yi Yang , Hui Zou

A convex formulation for high-dimensional sparse sliced inverse regression

Sliced inverse regression is a popular tool for sufficient dimension reduction, which replaces covariates with a minimal set of their linear combinations without loss of information on the conditional distribution of the response given the…

Machine Learning · Statistics 2018-09-18 Kean Ming Tan , Zhaoran Wang , Tong Zhang , Han Liu , R. Dennis Cook

Prediction and estimation consistency of sparse multi-class penalized optimal scoring

Sparse linear discriminant analysis via penalized optimal scoring is a successful tool for classification in high-dimensional settings. While the variable selection consistency of sparse optimal scoring has been established, the…

Statistics Theory · Mathematics 2021-04-01 Irina Gaynanova

Sparse Distributed Learning Based on Diffusion Adaptation

This article proposes diffusion LMS strategies for distributed estimation over adaptive networks that are able to exploit sparsity in the underlying system model. The approach relies on convex regularization, common in compressive sensing,…

Machine Learning · Computer Science 2015-06-05 Paolo Di Lorenzo , Ali H. Sayed

Sparse linear discriminant analysis by thresholding for high dimensional data

In many social, economical, biological and medical studies, one objective is to classify a subject into one of several classes based on a set of variables observed from the subject. Because the probability distribution of the variables is…

Statistics Theory · Mathematics 2011-05-19 Jun Shao , Yazhen Wang , Xinwei Deng , Sijian Wang

Distributed Sparse Linear Regression under Communication Constraints

In multiple domains, statistical tasks are performed in distributed settings, with data split among several end machines that are connected to a fusion center. In various applications, the end machines have limited bandwidth and power, and…

Machine Learning · Computer Science 2026-01-05 Rodney Fonseca , Boaz Nadler

Sparse Multivariate Factor Regression

We consider the problem of multivariate regression in a setting where the relevant predictors could be shared among different responses. We propose an algorithm which decomposes the coefficient matrix into the product of a long matrix and a…

Machine Learning · Statistics 2016-03-02 Milad Kharratzadeh , Mark Coates

Improving Group Lasso for high-dimensional categorical data

Sparse modelling or model selection with categorical data is challenging even for a moderate number of variables, because one parameter is roughly needed to encode one category or level. The Group Lasso is a well known efficient algorithm…

Methodology · Statistics 2022-11-14 Szymon Nowakowski , Piotr Pokarowski , Wojciech Rejchel , Agnieszka Sołtys

Exploring Sparsity in Multi-class Linear Discriminant Analysis

Recent studies in the literature have paid much attention to the sparsity in linear classification tasks. One motivation of imposing sparsity assumption on the linear discriminant direction is to rule out the noninformative features, making…

Machine Learning · Statistics 2015-01-13 Dong Xia

A supervised discriminant data representation: application to pattern classification

The performance of machine learning and pattern recognition algorithms generally depends on data representation. That is why, much of the current effort in performing machine learning algorithms goes into the design of preprocessing…

Machine Learning · Computer Science 2025-10-28 Fadi Dornaika , Ahmad Khoder , Abdelmalik Moujahid , Wassim Khoder

Distributed Linear Regression with Compositional Covariates

With the availability of extraordinarily huge data sets, solving the problems of distributed statistical methodology and computing for such data sets has become increasingly crucial in the big data area. In this paper, we focus on the…

Machine Learning · Statistics 2023-10-24 Yue Chao , Lei Huang , Xuejun Ma

Collaborative Regression

We consider the scenario where one observes an outcome variable and sets of features from multiple assays, all measured on the same set of samples. One approach that has been proposed for dealing with this type of data is ``sparse multiple…

Quantitative Methods · Quantitative Biology 2014-01-24 Samuel M. Gross , Robert Tibshirani

Robust Discriminative Clustering with Sparse Regularizers

Clustering high-dimensional data often requires some form of dimensionality reduction, where clustered variables are separated from "noise-looking" variables. We cast this problem as finding a low-dimensional projection of the data which is…

Machine Learning · Statistics 2016-08-30 Nicolas Flammarion , Balamurugan Palaniappan , Francis Bach

Distributed Learning with Sparse Communications by Identification

In distributed optimization for large-scale learning, a major performance limitation comes from the communications between the different entities. When computations are performed by workers on local data while a coordinator machine…

Optimization and Control · Mathematics 2020-06-26 Dmitry Grishchenko , Franck Iutzeler , Jérôme Malick , Massih-Reza Amini

Distributed Nonconvex Optimization for Sparse Representation

We consider a non-convex constrained Lagrangian formulation of a fundamental bi-criteria optimization problem for variable selection in statistical learning; the two criteria are a smooth (possibly) nonconvex loss function, measuring the…

Optimization and Control · Mathematics 2016-11-22 Ying Sun , Gesualdo Scutari

High-Dimensional Distributed Sparse Classification with Scalable Communication-Efficient Global Updates

As the size of datasets used in statistical learning continues to grow, distributed training of models has attracted increasing attention. These methods partition the data and exploit parallelism to reduce memory and runtime, but suffer…

Machine Learning · Computer Science 2024-07-10 Fred Lu , Ryan R. Curtin , Edward Raff , Francis Ferraro , James Holt

Learning Discriminative Multilevel Structured Dictionaries for Supervised Image Classification

Sparse representations using overcomplete dictionaries have proved to be a powerful tool in many signal processing applications such as denoising, super-resolution, inpainting, compression or classification. The sparsity of the…

Machine Learning · Statistics 2018-03-01 Jeremy Aghaei Mazaheri , Elif Vural , Claude Labit , Christine Guillemot

Discriminative Sparse Coding on Multi-Manifold for Data Representation and Classification

Sparse coding has been popularly used as an effective data representation method in various applications, such as computer vision, medical imaging and bioinformatics, etc. However, the conventional sparse coding algorithms and its manifold…

Computer Vision and Pattern Recognition · Computer Science 2013-04-04 Jing-Yan Wang

Communication-efficient sparse regression: a one-shot approach

We devise a one-shot approach to distributed sparse regression in the high-dimensional setting. The key idea is to average "debiased" or "desparsified" lasso estimators. We show the approach converges at the same rate as the lasso as long…

Machine Learning · Statistics 2015-08-12 Jason D. Lee , Yuekai Sun , Qiang Liu , Jonathan E. Taylor

Communication-efficient Distributed Sparse Linear Discriminant Analysis

We propose a communication-efficient distributed estimation method for sparse linear discriminant analysis (LDA) in the high dimensional regime. Our method distributes the data of size $N$ into $m$ machines, and estimates a local sparse LDA…

Machine Learning · Statistics 2016-10-18 Lu Tian , Quanquan Gu