Related papers: New developments in Sparse PLS regression

A new Universal Resample Stable Bootstrap-based Stopping Criterion in PLS Components Construction

We develop a new robust stopping criterion in Partial Least Squares Regressions (PLSR) components construction characterised by a high level of stability. This new criterion is defined as a universal one since it is suitable both for PLSR…

Methodology · Statistics 2021-08-17 Jérémy Magnanensi , Frédéric Bertrand , Myriam Maumy-Bertrand , Nicolas Meyer

Generative Flexible Latent Structure Regression (GFLSR) model

Latent structure methods, specifically linear continuous latent structure methods, are a type of fundamental statistical learning strategy. They are widely used for dimension reduction, regression and prediction, in the fields of…

Methodology · Statistics 2025-08-07 Clara Grazian , Qian Jin , Pierre Lafaye De Micheaux

High Dimensional Classification with combined Adaptive Sparse PLS and Logistic Regression

Motivation: The high dimensionality of genomic data calls for the development of specific classification methodologies, especially to prevent over-optimistic predictions. This challenge can be tackled by compression and variable selection,…

Methodology · Statistics 2021-04-10 G. Durif , L. Modolo , J. Michaelsson , J. E. Mold , S. Lambert-Lacroix , F. Picard

Jointly Sparse Global SIMPLS Regression

Partial least squares (PLS) regression combines dimensionality reduction and prediction using a latent variable model. Since partial least squares regression (PLS-R) does not require matrix inversion or diagonalization, it can be applied to…

Methodology · Statistics 2014-08-05 Tzu-Yu Liu , Laura Trinchera , Arthur Tenenhaus , Dennis Wei , Alfred O. Hero

Dual-sPLS: a family of Dual Sparse Partial Least Squares regressions for feature selection and prediction with tunable sparsity; evaluation on simulated and near-infrared (NIR) data

Relating a set of variables X to a response y is crucial in chemometrics. A quantitative prediction objective can be enriched by qualitative data interpretation, for instance by locating the most influential features. When high-dimensional…

Machine Learning · Statistics 2023-04-21 Louna Alsouki , Laurent Duval , Clément Marteau , Rami El Haddad , François Wahl

A non-asymptotic analysis of the single component PLS regression

This paper investigates some theoretical properties of the Partial Least Square (PLS) method. We focus our attention on the single component case, that provides a useful framework to understand the underlying mechanism. We provide a…

Statistics Theory · Mathematics 2023-10-17 Luca Castelli , Clément Marteau , Irène Gannaz

Integrative Sparse Partial Least Squares

Partial least squares, as a dimension reduction method, has become increasingly important for its ability to deal with problems with a large number of variables. Since noisy variables may weaken the performance of the model, the sparse…

Methodology · Statistics 2020-06-08 Weijuan Liang , Shuangge Ma , Qingzhao Zhang , Tingyu Zhu

PLS: a new statistical insight through the prism of orthogonal polynomials

Partial Least Square (PLS) is a dimension reduction method used to remove multicollinearities in a regression model. However contrary to Principal Components Analysis (PCA) the PLS components are also choosen to be optimal for predicting…

Statistics Theory · Mathematics 2014-05-26 Mélanie Blazère , Fabrice Gamboa , Jean-Michel Loubes

Variable selection for Gaussian process regression through a sparse projection

This paper presents a new variable selection approach integrated with Gaussian process (GP) regression. We consider a sparse projection of input variables and a general stationary covariance model that depends on the Euclidean distance…

Machine Learning · Computer Science 2020-08-26 Chiwoo Park , David J. Borth , Nicholas S. Wilson , Chad N. Hunter

Sparse Techniques for Regression in Deep Gaussian Processes

Gaussian processes (GPs) have gained popularity as flexible machine learning models for regression and function approximation with an in-built method for uncertainty quantification. However, GPs suffer when the amount of training data is…

Machine Learning · Statistics 2025-11-26 Jonas Latz , Aretha L. Teckentrup , Simon Urbainczyk

The Group Square-Root Lasso: Theoretical Properties and Fast Algorithms

We introduce and study the Group Square-Root Lasso (GSRL) method for estimation in high dimensional sparse regression models with group structure. The new estimator minimizes the square root of the residual sum of squares plus a penalty…

Statistics Theory · Mathematics 2013-08-01 Florentina Bunea , Johannes Lederer , Yiyuan She

A Unified Parallel Algorithm for Regularized Group PLS Scalable to Big Data

Partial Least Squares (PLS) methods have been heavily exploited to analyse the association between two blocs of data. These powerful approaches can be applied to data sets where the number of variables is greater than the number of…

Machine Learning · Statistics 2017-02-24 Pierre Lafaye de Micheaux , Benoit Liquet , Matthew Sutton

Scalable Partial Least Squares Regression on Grammar-Compressed Data Matrices

With massive high-dimensional data now commonplace in research and industry, there is a strong and growing demand for more scalable computational techniques for data analysis and knowledge discovery. Key to turning these data into knowledge…

Data Structures and Algorithms · Computer Science 2016-06-17 Yasuo Tabei , Hiroto Saigo , Yoshihiro Yamanishi , Simon J. Puglisi

A Bayesian Lasso based Sparse Learning Model

The Bayesian Lasso is constructed in the linear regression framework and applies the Gibbs sampling to estimate the regression parameters. This paper develops a new sparse learning model, named the Bayesian Lasso Sparse (BLS) model, that…

Machine Learning · Statistics 2022-07-15 Ingvild M. Helgøy , Yushu Li

Principal Balances of Compositional Data for Regression and Classification using Partial Least Squares

High-dimensional compositional data are commonplace in the modern omics sciences amongst others. Analysis of compositional data requires a proper choice of orthonormal coordinate representation as their relative nature is not compatible…

Methodology · Statistics 2022-11-04 V. Nesrstová , I. Wilms , J. Palarea-Albaladejo , P. Filzmoser , J. A. Martín-Fernández , D. Friedecký , K. Hron

Bayesian data-driven discovery of partial differential equations with variable coefficients

The discovery of Partial Differential Equations (PDEs) is an essential task for applied science and engineering. However, data-driven discovery of PDEs is generally challenging, primarily stemming from the sensitivity of the discovered…

Machine Learning · Statistics 2024-03-27 Aoxue Chen , Yifan Du , Liyao Mars Gao , Guang Lin

Variable Selection with Scalable Bootstrap in Generalized Linear Model for Massive Data

Bootstrap is commonly used as a tool for non-parametric statistical inference to estimate meaningful parameters in Variable Selection Models. However, for massive dataset that has exponential growth rate, the computation of Bootstrap…

Computation · Statistics 2016-12-26 Zhibing He , Yichen Qin , Ben-Chang Shia , Yang Li

Robust and Sparse Regression in GLM by Stochastic Optimization

The generalized linear model (GLM) plays a key role in regression analyses. In high-dimensional data, the sparse GLM has been used but it is not robust against outliers. Recently, the robust methods have been proposed for the specific…

Machine Learning · Statistics 2026-05-15 Takayuki Kawashima , Hironori Fujisawa

SIGLE: a valid procedure for Selective Inference with the Generalized Linear Lasso

This article investigates uncertainty quantification of the generalized linear lasso~(GLL), a popular variable selection method in high-dimensional regression settings. In many fields of study, researchers use data-driven methods to select…

Statistics Theory · Mathematics 2023-07-11 Quentin Duchemin , Yohann de Castro

Partial sequence labeling with structured Gaussian Processes

Existing partial sequence labeling models mainly focus on max-margin framework which fails to provide an uncertainty estimation of the prediction. Further, the unique ground truth disambiguation strategy employed by these models may include…

Machine Learning · Computer Science 2022-09-21 Xiaolei Lu , Tommy W. S. Chow