English
Related papers

Related papers: New developments in Sparse PLS regression

200 papers

We develop a new robust stopping criterion in Partial Least Squares Regressions (PLSR) components construction characterised by a high level of stability. This new criterion is defined as a universal one since it is suitable both for PLSR…

Methodology · Statistics 2021-08-17 Jérémy Magnanensi , Frédéric Bertrand , Myriam Maumy-Bertrand , Nicolas Meyer

Latent structure methods, specifically linear continuous latent structure methods, are a type of fundamental statistical learning strategy. They are widely used for dimension reduction, regression and prediction, in the fields of…

Methodology · Statistics 2025-08-07 Clara Grazian , Qian Jin , Pierre Lafaye De Micheaux

Motivation: The high dimensionality of genomic data calls for the development of specific classification methodologies, especially to prevent over-optimistic predictions. This challenge can be tackled by compression and variable selection,…

Methodology · Statistics 2021-04-10 G. Durif , L. Modolo , J. Michaelsson , J. E. Mold , S. Lambert-Lacroix , F. Picard

Partial least squares (PLS) regression combines dimensionality reduction and prediction using a latent variable model. Since partial least squares regression (PLS-R) does not require matrix inversion or diagonalization, it can be applied to…

Methodology · Statistics 2014-08-05 Tzu-Yu Liu , Laura Trinchera , Arthur Tenenhaus , Dennis Wei , Alfred O. Hero

Relating a set of variables X to a response y is crucial in chemometrics. A quantitative prediction objective can be enriched by qualitative data interpretation, for instance by locating the most influential features. When high-dimensional…

Machine Learning · Statistics 2023-04-21 Louna Alsouki , Laurent Duval , Clément Marteau , Rami El Haddad , François Wahl

This paper investigates some theoretical properties of the Partial Least Square (PLS) method. We focus our attention on the single component case, that provides a useful framework to understand the underlying mechanism. We provide a…

Statistics Theory · Mathematics 2023-10-17 Luca Castelli , Clément Marteau , Irène Gannaz

Partial least squares, as a dimension reduction method, has become increasingly important for its ability to deal with problems with a large number of variables. Since noisy variables may weaken the performance of the model, the sparse…

Methodology · Statistics 2020-06-08 Weijuan Liang , Shuangge Ma , Qingzhao Zhang , Tingyu Zhu

Partial Least Square (PLS) is a dimension reduction method used to remove multicollinearities in a regression model. However contrary to Principal Components Analysis (PCA) the PLS components are also choosen to be optimal for predicting…

Statistics Theory · Mathematics 2014-05-26 Mélanie Blazère , Fabrice Gamboa , Jean-Michel Loubes

This paper presents a new variable selection approach integrated with Gaussian process (GP) regression. We consider a sparse projection of input variables and a general stationary covariance model that depends on the Euclidean distance…

Machine Learning · Computer Science 2020-08-26 Chiwoo Park , David J. Borth , Nicholas S. Wilson , Chad N. Hunter

Gaussian processes (GPs) have gained popularity as flexible machine learning models for regression and function approximation with an in-built method for uncertainty quantification. However, GPs suffer when the amount of training data is…

Machine Learning · Statistics 2025-11-26 Jonas Latz , Aretha L. Teckentrup , Simon Urbainczyk

We introduce and study the Group Square-Root Lasso (GSRL) method for estimation in high dimensional sparse regression models with group structure. The new estimator minimizes the square root of the residual sum of squares plus a penalty…

Statistics Theory · Mathematics 2013-08-01 Florentina Bunea , Johannes Lederer , Yiyuan She

Partial Least Squares (PLS) methods have been heavily exploited to analyse the association between two blocs of data. These powerful approaches can be applied to data sets where the number of variables is greater than the number of…

Machine Learning · Statistics 2017-02-24 Pierre Lafaye de Micheaux , Benoit Liquet , Matthew Sutton

With massive high-dimensional data now commonplace in research and industry, there is a strong and growing demand for more scalable computational techniques for data analysis and knowledge discovery. Key to turning these data into knowledge…

Data Structures and Algorithms · Computer Science 2016-06-17 Yasuo Tabei , Hiroto Saigo , Yoshihiro Yamanishi , Simon J. Puglisi

The Bayesian Lasso is constructed in the linear regression framework and applies the Gibbs sampling to estimate the regression parameters. This paper develops a new sparse learning model, named the Bayesian Lasso Sparse (BLS) model, that…

Machine Learning · Statistics 2022-07-15 Ingvild M. Helgøy , Yushu Li

High-dimensional compositional data are commonplace in the modern omics sciences amongst others. Analysis of compositional data requires a proper choice of orthonormal coordinate representation as their relative nature is not compatible…

The discovery of Partial Differential Equations (PDEs) is an essential task for applied science and engineering. However, data-driven discovery of PDEs is generally challenging, primarily stemming from the sensitivity of the discovered…

Machine Learning · Statistics 2024-03-27 Aoxue Chen , Yifan Du , Liyao Mars Gao , Guang Lin

Bootstrap is commonly used as a tool for non-parametric statistical inference to estimate meaningful parameters in Variable Selection Models. However, for massive dataset that has exponential growth rate, the computation of Bootstrap…

Computation · Statistics 2016-12-26 Zhibing He , Yichen Qin , Ben-Chang Shia , Yang Li

The generalized linear model (GLM) plays a key role in regression analyses. In high-dimensional data, the sparse GLM has been used but it is not robust against outliers. Recently, the robust methods have been proposed for the specific…

Machine Learning · Statistics 2026-05-15 Takayuki Kawashima , Hironori Fujisawa

This article investigates uncertainty quantification of the generalized linear lasso~(GLL), a popular variable selection method in high-dimensional regression settings. In many fields of study, researchers use data-driven methods to select…

Statistics Theory · Mathematics 2023-07-11 Quentin Duchemin , Yohann de Castro

Existing partial sequence labeling models mainly focus on max-margin framework which fails to provide an uncertainty estimation of the prediction. Further, the unique ground truth disambiguation strategy employed by these models may include…

Machine Learning · Computer Science 2022-09-21 Xiaolei Lu , Tommy W. S. Chow
‹ Prev 1 2 3 10 Next ›