Related papers: Sparse matrix linear models for structured high-th…

A Sparse Linear Model for Positive Definite Estimation of Covariance Matrices

Sparse covariance matrices play crucial roles by encoding the interdependencies between variables in numerous fields such as genetics and neuroscience. Despite substantial studies on sparse covariance matrices, existing methods face several…

Methodology · Statistics 2026-03-03 Rakheon Kim , Irina Gaynanova

Model-based Clustering with Sparse Covariance Matrices

Finite Gaussian mixture models are widely used for model-based clustering of continuous data. Nevertheless, since the number of model parameters scales quadratically with the number of variables, these models can be easily…

Methodology · Statistics 2018-09-25 Michael Fop , Thomas Brendan Murphy , Luca Scrucca

Sparse Multivariate Linear Regression with Strongly Associated Response Variables

We propose new methods for multivariate linear regression when the regression coefficient matrix is sparse and the error covariance matrix is dense. We assume that the error covariance matrix has equicorrelation across the response…

Methodology · Statistics 2025-08-13 Daeyoung Ham , Bradley S. Price , Adam J. Rothman

Fast and efficient algorithms for sparse semiparametric bi-functional regression

A new sparse semiparametric model is proposed, which incorporates the influence of two functional random variables in a scalar response in a flexible and interpretable manner. One of the functional covariates is included through a…

Methodology · Statistics 2024-01-29 Silvia Novo , Philippe Vieu , Germán Aneiros

Regularized Matrix Regression

Modern technologies are producing a wealth of data with complex structures. For instance, in two-dimensional digital imaging, flow cytometry, and electroencephalography, matrix type covariates frequently arise when measurements are obtained…

Methodology · Statistics 2013-10-22 Hua Zhou , Lexin Li

A Survey of Estimation Methods for Sparse High-dimensional Time Series Models

High-dimensional time series datasets are becoming increasingly common in many areas of biological and social sciences. Some important applications include gene regulatory network reconstruction using time course gene expression data, brain…

Methodology · Statistics 2021-08-02 Sumanta Basu , David S. Matteson

A Simple Correction Procedure for High-Dimensional Generalized Linear Models with Measurement Error

We consider high-dimensional generalized linear models when the covariates are contaminated by measurement error. Estimates from errors-in-variables regression models are well-known to be biased in traditional low-dimensional settings if…

Computation · Statistics 2020-01-06 Michael Byrd , Monnie McGee

Sparse Group Selection Through Co-Adaptive Penalties

Recent work has focused on the problem of conducting linear regression when the number of covariates is very large, potentially greater than the sample size. To facilitate this, one useful tool is to assume that the model can be well…

Methodology · Statistics 2011-11-21 Zhou Fang

Fast Dual-Regularized Autoencoder for Sparse Biological Data

Relationship inference from sparse data is an important task with applications ranging from product recommendation to drug discovery. A recently proposed linear model for sparse matrix completion has demonstrated surprising advantage in…

Machine Learning · Computer Science 2024-03-14 Aleksandar Poleksic

Sparse Linear Regression With Missing Data

This paper proposes a fast and accurate method for sparse regression in the presence of missing data. The underlying statistical model encapsulates the low-dimensional structure of the incomplete data matrix and the sparsity of the…

Machine Learning · Statistics 2015-03-31 Ravi Ganti , Rebecca M. Willett

Fast Sparse Classification for Generalized Linear and Additive Models

We present fast classification techniques for sparse generalized linear and additive models. These techniques can handle thousands of features and thousands of observations in minutes, even in the presence of many highly correlated…

Machine Learning · Computer Science 2022-11-01 Jiachang Liu , Chudi Zhong , Margo Seltzer , Cynthia Rudin

Missing Data in Sparse Transition Matrix Estimation for Sub-Gaussian Vector Autoregressive Processes

High-dimensional time series data exist in numerous areas such as finance, genomics, healthcare, and neuroscience. An unavoidable aspect of all such datasets is missing data, and dealing with this issue has been an important focus in…

Machine Learning · Statistics 2018-02-27 Amin Jalali , Rebecca Willett

Sparser, Faster, Lighter Transformer Language Models

Scaling autoregressive large language models (LLMs) has driven unprecedented progress but comes with vast computational costs. In this work, we tackle these costs by leveraging unstructured sparsity within an LLM's feedforward layers, the…

Machine Learning · Computer Science 2026-05-11 Edoardo Cetin , Stefano Peluchetti , Emilio Castillo , Akira Naruse , Mana Murakami , Llion Jones

High-Dimensional Distributed Sparse Classification with Scalable Communication-Efficient Global Updates

As the size of datasets used in statistical learning continues to grow, distributed training of models has attracted increasing attention. These methods partition the data and exploit parallelism to reduce memory and runtime, but suffer…

Machine Learning · Computer Science 2024-07-10 Fred Lu , Ryan R. Curtin , Edward Raff , Francis Ferraro , James Holt

Network-regularized Sparse Logistic Regression Models for Clinical Risk Prediction and Biomarker Discovery

Molecular profiling data (e.g., gene expression) has been used for clinical risk prediction and biomarker discovery. However, it is necessary to integrate other prior knowledge like biological pathways or gene interaction networks to…

Genomics · Quantitative Biology 2016-09-22 Wenwen Min , Juan Liu , Shihua Zhang

For data with high-dimensional covariates but small to moderate sample sizes, the analysis of single datasets often generates unsatisfactory results. The integrative analysis of multiple independent datasets provides an effective way of…

Methodology · Statistics 2015-01-19 Yuan Huang , Qingzhao Zhang , Sanguo Zhang , Jian Huang , Shuangge Ma

Efficient Penalized Generalized Linear Mixed Models for Variable Selection and Genetic Risk Prediction in High-Dimensional Data

Sparse regularized regression methods are now widely used in genome-wide association studies (GWAS) to address the multiple testing burden that limits discovery of potentially important predictors. Linear mixed models (LMMs) have become an…

Methodology · Statistics 2022-06-27 Julien St-Pierre , Karim Oualkacha , Sahir Rai Bhatnagar

Accelerated structured matrix factorization

Matrix factorization exploits the idea that, in complex high-dimensional data, the actual signal typically lies in lower-dimensional structures. These lower dimensional objects provide useful insight, with interpretability favored by sparse…

Methodology · Statistics 2022-12-14 Lorenzo Schiavon , Bernardo Nipoti , Antonio Canale

Efficient Computation for Centered Linear Regression with Sparse Inputs

Regression with sparse inputs is a common theme for large scale models. Optimizing the underlying linear algebra for sparse inputs allows such models to be estimated faster. At the same time, centering the inputs has benefits in improving…

Computation · Statistics 2019-10-30 Jeffrey Wong

Joint estimation of sparse multivariate regression and conditional graphical models

Multivariate regression model is a natural generalization of the classical univari- ate regression model for fitting multiple responses. In this paper, we propose a high- dimensional multivariate conditional regression model for…

Machine Learning · Statistics 2016-11-26 Junhui Wang