Related papers: Supervised Learning for Multi-Block Incomplete Dat…

High-Dimensional Partial Least Squares: Spectral Analysis and Fundamental Limitations

Partial Least Squares (PLS) is a widely used method for data integration, designed to extract latent components shared across paired high-dimensional datasets. Despite decades of practical success, a precise theoretical understanding of its…

Machine Learning · Statistics 2025-12-18 Victor Léger , Florent Chatelain

Weighted Sparse Partial Least Squares for Joint Sample and Feature Selection

Sparse Partial Least Squares (sPLS) is a common dimensionality reduction technique for data fusion, which projects data samples from two views by seeking linear combinations with a small number of variables with the maximum variance.…

Machine Learning · Computer Science 2023-08-15 Wenwen Min , Taosheng Xu , Chris Ding

Integrative Sparse Partial Least Squares

Partial least squares, as a dimension reduction method, has become increasingly important for its ability to deal with problems with a large number of variables. Since noisy variables may weaken the performance of the model, the sparse…

Methodology · Statistics 2020-06-08 Weijuan Liang , Shuangge Ma , Qingzhao Zhang , Tingyu Zhu

A Unified Parallel Algorithm for Regularized Group PLS Scalable to Big Data

Partial Least Squares (PLS) methods have been heavily exploited to analyse the association between two blocs of data. These powerful approaches can be applied to data sets where the number of variables is greater than the number of…

Machine Learning · Statistics 2017-02-24 Pierre Lafaye de Micheaux , Benoit Liquet , Matthew Sutton

Missing-Data-Induced Phase Transitions in Spectral PLS for Multimodal Learning

Partial Least Squares (PLS) learns shared structure from paired data via the top singular vectors of the empirical cross-covariance (PLS-SVD), but multimodal datasets often have missing entries in both views. We study PLS-SVD under…

Machine Learning · Computer Science 2026-05-18 Anders Gjølbye , Ida Kargaard , Emma Kargaard , Lina Skerath , Lars Kai Hansen

Deep Learning Partial Least Squares

High dimensional data reduction techniques are provided by using partial least squares within deep learning. Our framework provides a nonlinear extension of PLS together with a disciplined approach to feature selection and architecture…

Methodology · Statistics 2021-06-29 Nicholas Polson , Vadim Sokolov , Jianeng Xu

High Dimensional Classification with combined Adaptive Sparse PLS and Logistic Regression

Motivation: The high dimensionality of genomic data calls for the development of specific classification methodologies, especially to prevent over-optimistic predictions. This challenge can be tackled by compression and variable selection,…

Methodology · Statistics 2021-04-10 G. Durif , L. Modolo , J. Michaelsson , J. E. Mold , S. Lambert-Lacroix , F. Picard

Partial least squares discriminant analysis: A dimensionality reduction method to classify hyperspectral data

The recent development of more sophisticated spectroscopic methods allows acqui- sition of high dimensional datasets from which valuable information may be extracted using multivariate statistical analyses, such as dimensionality reduction…

Applications · Statistics 2023-11-14 Mario Fordellone , Andrea Bellincontro , Fabio Mencarelli

Dual-sPLS: a family of Dual Sparse Partial Least Squares regressions for feature selection and prediction with tunable sparsity; evaluation on simulated and near-infrared (NIR) data

Relating a set of variables X to a response y is crucial in chemometrics. A quantitative prediction objective can be enriched by qualitative data interpretation, for instance by locating the most influential features. When high-dimensional…

Machine Learning · Statistics 2023-04-21 Louna Alsouki , Laurent Duval , Clément Marteau , Rami El Haddad , François Wahl

In all LikelihoodS: How to Reliably Select Pseudo-Labeled Data for Self-Training in Semi-Supervised Learning

Self-training is a simple yet effective method within semi-supervised learning. The idea is to iteratively enhance training data by adding pseudo-labeled data. Its generalization performance heavily depends on the selection of these…

Machine Learning · Statistics 2023-03-03 Julian Rodemann , Christoph Jansen , Georg Schollmeyer , Thomas Augustin

Unsupervised Data Imputation via Variational Inference of Deep Subspaces

A wide range of systems exhibit high dimensional incomplete data. Accurate estimation of the missing data is often desired, and is crucial for many downstream analyses. Many state-of-the-art recovery methods involve supervised learning…

Computer Vision and Pattern Recognition · Computer Science 2019-03-15 Adrian V. Dalca , John Guttag , Mert R. Sabuncu

Learning sparse gradients for variable selection and dimension reduction

Variable selection and dimension reduction are two commonly adopted approaches for high-dimensional data analysis, but have traditionally been treated separately. Here we propose an integrated approach, called sparse gradient learning…

Machine Learning · Statistics 2010-07-02 Gui-Bo Ye , Xiaohui Xie

Enhancing Semi-Supervised Learning via Representative and Diverse Sample Selection

Semi-Supervised Learning (SSL) has become a preferred paradigm in many deep learning tasks, which reduces the need for human labor. Previous studies primarily focus on effectively utilising the labelled and unlabeled data to improve…

Machine Learning · Computer Science 2024-10-29 Qian Shao , Jiangrui Kang , Qiyuan Chen , Zepeng Li , Hongxia Xu , Yiwen Cao , Jiajuan Liang , Jian Wu

Random Matrix Analysis to Balance between Supervised and Unsupervised Learning under the Low Density Separation Assumption

We propose a theoretical framework to analyze semi-supervised classification under the low density separation assumption in a high-dimensional regime. In particular, we introduce QLDS, a linear classification model, where the low density…

Machine Learning · Computer Science 2023-10-23 Vasilii Feofanov , Malik Tiomoko , Aladin Virmaux

Regularized Partial Least Squares with an Application to NMR Spectroscopy

High-dimensional data common in genomics, proteomics, and chemometrics often contains complicated correlation structures. Recently, partial least squares (PLS) and Sparse PLS methods have gained attention in these areas as dimension…

Machine Learning · Statistics 2012-04-19 Genevera I. Allen , Christine Peterson , Marina Vannucci , Mirjana Maletic-Savatic

Semi-Supervised Learning in the Few-Shot Zero-Shot Scenario

Semi-Supervised Learning (SSL) is a framework that utilizes both labeled and unlabeled data to enhance model performance. Conventional SSL methods operate under the assumption that labeled and unlabeled data share the same label space.…

Computer Vision and Pattern Recognition · Computer Science 2023-11-16 Noam Fluss , Guy Hacohen , Daphna Weinshall

Optimistic Semi-supervised Least Squares Classification

The goal of semi-supervised learning is to improve supervised classifiers by using additional unlabeled training examples. In this work we study a simple self-learning approach to semi-supervised learning applied to the least squares…

Machine Learning · Statistics 2016-10-13 Jesse H. Krijthe , Marco Loog

Multiple imputation using dimension reduction techniques for high-dimensional data

Missing data present challenges in data analysis. Naive analyses such as complete-case and available-case analysis may introduce bias and loss of efficiency, and produce unreliable results. Multiple imputation (MI) is one of the most widely…

Methodology · Statistics 2019-05-15 Domonique W. Hodge , Sandra E. Safo , Qi Long

Jointly Sparse Global SIMPLS Regression

Partial least squares (PLS) regression combines dimensionality reduction and prediction using a latent variable model. Since partial least squares regression (PLS-R) does not require matrix inversion or diagonalization, it can be applied to…

Methodology · Statistics 2014-08-05 Tzu-Yu Liu , Laura Trinchera , Arthur Tenenhaus , Dennis Wei , Alfred O. Hero

Block Sparse Bayesian Learning: A Diversified Scheme

This paper introduces a novel prior called Diversified Block Sparse Prior to characterize the widespread block sparsity phenomenon in real-world data. By allowing diversification on intra-block variance and inter-block correlation matrices,…

Machine Learning · Computer Science 2024-10-31 Yanhao Zhang , Zhihan Zhu , Yong Xia