Related papers: Sparse Probit Linear Mixed Model

Polygenic Modeling with Bayesian Sparse Linear Mixed Models

Both linear mixed models (LMMs) and sparse regression models are widely used in genetics applications, including, recently, polygenic modeling in genome-wide association studies. These two approaches make very different assumptions, so are…

Quantitative Methods · Quantitative Biology 2012-11-16 Xiang Zhou , Peter Carbonetto , Matthew Stephens

Scalable Subset Selection in Linear Mixed Models

Linear mixed models (LMMs), which incorporate fixed and random effects, are key tools for analyzing heterogeneous data, such as in personalized medicine. Nowadays, this type of data is increasingly wide, sometimes containing thousands of…

Machine Learning · Statistics 2026-05-15 Ryan Thompson , Matt P. Wand , Joanna J. J. Wang

Methods of Selective Inference for Linear Mixed Models: a Review and Empirical Comparison

Selective inference aims at providing valid inference after a data-driven selection of models or hypotheses. It is essential to avoid overconfident results and replicability issues. While significant advances have been made in this area for…

Methodology · Statistics 2025-03-14 Matteo D'Alessandro , Magne Thoresen

Efficient Penalized Generalized Linear Mixed Models for Variable Selection and Genetic Risk Prediction in High-Dimensional Data

Sparse regularized regression methods are now widely used in genome-wide association studies (GWAS) to address the multiple testing burden that limits discovery of potentially important predictors. Linear mixed models (LMMs) have become an…

Methodology · Statistics 2022-06-27 Julien St-Pierre , Karim Oualkacha , Sahir Rai Bhatnagar

A Sparse Graph-Structured Lasso Mixed Model for Genetic Association with Confounding Correction

While linear mixed model (LMM) has shown a competitive performance in correcting spurious associations raised by population stratification, family structures, and cryptic relatedness, more challenges are still to be addressed regarding the…

Machine Learning · Computer Science 2023-02-15 Wenting Ye , Xiang Liu , Tianwei Yue , Wenping Wang

Sparse Linear Mixed Model Selection via Streamlined Variational Bayes

Linear mixed models are a versatile statistical tool to study data by accounting for fixed effects and random effects from multiple sources of variability. In many situations, a large number of candidate fixed effects is available and it is…

Methodology · Statistics 2022-09-09 Emanuele Degani , Luca Maestrini , Dorota Toczydłowska , Matt P. Wand

Genetic Analysis of Transformed Phenotypes

Linear mixed models (LMMs) are a powerful and established tool for studying genotype-phenotype relationships. A limiting assumption of LMMs is that the residuals are Gaussian distributed, a requirement that rarely holds in practice.…

Genomics · Quantitative Biology 2014-08-10 Nicolo Fusi , Christoph Lippert , Neil D. Lawrence , Oliver Stegle

Sparse Linear Isotonic Models

In machine learning and data mining, linear models have been widely used to model the response as parametric linear functions of the predictors. To relax such stringent assumptions made by parametric linear models, additive models consider…

Machine Learning · Statistics 2017-10-18 Sheng Chen , Arindam Banerjee

Symbolic Formulae for Linear Mixed Models

A statistical model is a mathematical representation of an often simplified or idealised data-generating process. In this paper, we focus on a particular type of statistical model, called linear mixed models (LMMs), that is widely used in…

Methodology · Statistics 2020-01-23 Emi Tanaka , Francis K. C. Hui

Learning sparse generalized linear models with binary outcomes via iterative hard thresholding

In statistics, generalized linear models (GLMs) are widely used for modeling data and can expressively capture potential nonlinear dependence of the model's outcomes on its covariates. Within the broad family of GLMs, those with binary…

Statistics Theory · Mathematics 2025-09-04 Namiko Matsumoto , Arya Mazumdar

Sparse Partially Linear Additive Models

The generalized partially linear additive model (GPLAM) is a flexible and interpretable approach to building predictive models. It combines features in an additive manner, allowing each to have either a linear or nonlinear effect on the…

Methodology · Statistics 2018-03-29 Yin Lou , Jacob Bien , Rich Caruana , Johannes Gehrke

Sparse Linear Identifiable Multivariate Modeling

In this paper we consider sparse and identifiable linear latent variable (factor) and linear Bayesian network models for parsimonious analysis of multivariate data. We propose a computationally efficient method for joint parameter and model…

Machine Learning · Statistics 2011-06-24 Ricardo Henao , Ole Winther

Sparse high-dimensional linear mixed modeling with a partitioned empirical Bayes ECM algorithm

High-dimensional longitudinal data is increasingly used in a wide range of scientific studies. To properly account for dependence between longitudinal observations, statistical methods for high-dimensional linear mixed models (LMMs) have…

Methodology · Statistics 2024-07-10 Anja Zgodic , Ray Bai , Jiajia Zhang , Peter Olejua , Alexander C. McLain

Sparse Conditional Hidden Markov Model for Weakly Supervised Named Entity Recognition

Weakly supervised named entity recognition methods train label models to aggregate the token annotations of multiple noisy labeling functions (LFs) without seeing any manually annotated labels. To work well, the label model needs to…

Computation and Language · Computer Science 2022-06-08 Yinghao Li , Le Song , Chao Zhang

A near-exact linear mixed model for genome-wide association studies

Linear mixed models (LMM) are widely adopted in genome-wide association studies (GWAS) to account for population stratification and cryptic relatedness. However, the parameter estimation of LMMs imposes substantial computational burdens due…

Computation · Statistics 2025-08-08 Zhibin Pu , Shufei Ge , Shijia Wang

Supersparse Linear Integer Models for Predictive Scoring Systems

We introduce Supersparse Linear Integer Models (SLIM) as a tool to create scoring systems for binary classification. We derive theoretical bounds on the true risk of SLIM scoring systems, and present experimental results to show that SLIM…

Machine Learning · Statistics 2013-06-26 Berk Ustun , Stefano Traca , Cynthia Rudin

Linear Mixed Models with Marginally Symmetric Nonparametric Random Effects

Linear mixed models (LMMs) are used as an important tool in the data analysis of repeated measures and longitudinal studies. The most common form of LMMs utilize a normal distribution to model the random effects. Such assumptions can often…

Methodology · Statistics 2016-02-16 Hien D. Nguyen , Geoffrey J. McLachlan

SLM: End-to-end Feature Selection via Sparse Learnable Masks

Feature selection has been widely used to alleviate compute requirements during training, elucidate model interpretability, and improve model generalizability. We propose SLM -- Sparse Learnable Masks -- a canonical approach for end-to-end…

Machine Learning · Computer Science 2023-04-07 Yihe Dong , Sercan O. Arik

Scalable Algorithms for Learning High-Dimensional Linear Mixed Models

Linear mixed models (LMMs) are used extensively to model dependecies of observations in linear regression and are used extensively in many application areas. Parameter estimation for LMMs can be computationally prohibitive on big data.…

Machine Learning · Statistics 2019-03-08 Zilong Tan , Kimberly Roche , Xiang Zhou , Sayan Mukherjee

Marginally specified models for analyzing multivariate longitudinal binary data

Marginally specified models have recently become a popular tool for discrete longitudinal data analysis. Nonetheless, they introduce complex constraint equations and model fitting algorithms. Moreover, there is a lack of available software…

Methodology · Statistics 2014-05-15 Ozgur Asar , Ozlem Ilk