Related papers: Subset selection for linear mixed models

Methods of Selective Inference for Linear Mixed Models: a Review and Empirical Comparison

Selective inference aims at providing valid inference after a data-driven selection of models or hypotheses. It is essential to avoid overconfident results and replicability issues. While significant advances have been made in this area for…

Methodology · Statistics 2025-03-14 Matteo D'Alessandro , Magne Thoresen

Scalable Subset Selection in Linear Mixed Models

Linear mixed models (LMMs), which incorporate fixed and random effects, are key tools for analyzing heterogeneous data, such as in personalized medicine. Nowadays, this type of data is increasingly wide, sometimes containing thousands of…

Machine Learning · Statistics 2026-05-15 Ryan Thompson , Matt P. Wand , Joanna J. J. Wang

Model Selection in Linear Mixed Models

Linear mixed effects models are highly flexible in handling a broad range of data types and are therefore widely used in applications. A key part in the analysis of data is model selection, which often aims to choose a parsimonious model…

Methodology · Statistics 2013-06-12 Samuel Müller , J. L. Scealy , A. H. Welsh

Combined shrinkage of fixed and random effects in linear mixed models using empirical Bayes

A novel data-driven methodology is presented for the joint selection of prior parameters for both fixed and random effects in Linear Mixed Models (LMMs). This approach facilitates the estimation of complex random-effects structures, as well…

Methodology · Statistics 2026-04-28 Matteo Amestoy , R. Vermeulen , Mark A. van de Wiel , Wessel N. van Wieringen

Bayesian subset selection and variable importance for interpretable prediction and classification

Subset selection is a valuable tool for interpretable learning, scientific discovery, and data compression. However, classical subset selection is often avoided due to selection instability, lack of regularization, and difficulties with…

Machine Learning · Statistics 2022-02-17 Daniel R. Kowal

Bayesian high-dimensional covariate selection in non-linear mixed-effects models using the SAEM algorithm

High-dimensional variable selection, with many more covariates than observations, is widely documented in standard regression models, but there are still few tools to address it in non-linear mixed-effects models where data are collected…

Statistics Theory · Mathematics 2024-04-08 Marion Naveau , Guillaume Kon Kam King , Renaud Rincent , Laure Sansonnet , Maud Delattre

Bayesian Variable Selection in Distributed Lag Models: A Focus on Binary Quantile and Count Data Regressions

Distributed Lag Models (DLMs) and similar regression approaches such as MIDAS have been used for many decades in econometrics and more recently to investigate how poor air quality adversely affects human health. In this paper we describe…

Methodology · Statistics 2025-01-30 Daniel Dempsey , Jason Wyse

Genetic Analysis of Transformed Phenotypes

Linear mixed models (LMMs) are a powerful and established tool for studying genotype-phenotype relationships. A limiting assumption of LMMs is that the residuals are Gaussian distributed, a requirement that rarely holds in practice.…

Genomics · Quantitative Biology 2014-08-10 Nicolo Fusi , Christoph Lippert , Neil D. Lawrence , Oliver Stegle

Bayesian Adaptive Lasso

We propose the Bayesian adaptive Lasso (BaLasso) for variable selection and coefficient estimation in linear regression. The BaLasso is adaptive to the signal level by adopting different shrinkage for different coefficients. Furthermore, we…

Methodology · Statistics 2010-09-14 Chenlei Leng , Minh Ngoc Tran , David Nott

Bayesian variable selection for latent class analysis using a collapsed Gibbs sampler

Latent class analysis is used to perform model based clustering for multivariate categorical responses. Selection of the variables most relevant for clustering is an important task which can affect the quality of clustering considerably.…

Computation · Statistics 2016-06-17 Arthur White , Jason Wyse , Thomas Brendan Murphy

Subset Selection for Multiple Linear Regression via Optimization

Subset selection in multiple linear regression aims to choose a subset of candidate explanatory variables that tradeoff fitting error (explanatory power) and model complexity (number of variables selected). We build mathematical programming…

Machine Learning · Statistics 2020-09-04 Young Woong Park , Diego Klabjan

Sparse Probit Linear Mixed Model

Linear Mixed Models (LMMs) are important tools in statistical genetics. When used for feature selection, they allow to find a sparse set of genetic traits that best predict a continuous phenotype of interest, while simultaneously correcting…

Machine Learning · Statistics 2017-09-12 Stephan Mandt , Florian Wenzel , Shinichi Nakajima , John P. Cunningham , Christoph Lippert , Marius Kloft

Scalable Algorithms for Learning High-Dimensional Linear Mixed Models

Linear mixed models (LMMs) are used extensively to model dependecies of observations in linear regression and are used extensively in many application areas. Parameter estimation for LMMs can be computationally prohibitive on big data.…

Machine Learning · Statistics 2019-03-08 Zilong Tan , Kimberly Roche , Xiang Zhou , Sayan Mukherjee

Ultra-efficient MCMC for Bayesian longitudinal functional data analysis

Functional mixed models are widely useful for regression analysis with dependent functional data, including longitudinal functional data with scalar predictors. However, existing algorithms for Bayesian inference with these models only…

Methodology · Statistics 2023-06-14 Thomas Y. Sun , Daniel R. Kowal

Bayesian Variable Selection in a Million Dimensions

Bayesian variable selection is a powerful tool for data analysis, as it offers a principled method for variable selection that accounts for prior information and uncertainty. However, wider adoption of Bayesian variable selection has been…

Methodology · Statistics 2023-12-06 Martin Jankowiak

Assessing an Alternative for `Negative Variance Components': A Gentle Introduction to Bayesian Covariance Structure Modelling for Negative Associations Among Patients with Personalized Treatments

The multilevel model (MLM) is the popular approach to describe dependences of hierarchically clustered observations. A main feature is the capability to estimate (cluster-specific) random effect parameters, while their distribution…

Methodology · Statistics 2021-06-21 Jean-Paul Fox , Wouter Smink

Flexible Bayesian Nonlinear Model Configuration

Regression models are used in a wide range of applications providing a powerful scientific tool for researchers from different fields. Linear, or simple parametric, models are often not sufficient to describe complex relationships between…

Machine Learning · Statistics 2021-11-24 Aliaksandr Hubin , Geir Storvik , Florian Frommlet

Bayesian model selection in additive partial linear models via locally adaptive splines

We provide a flexible framework for selecting among a class of additive partial linear models that allows both linear and nonlinear additive components. In practice, it is challenging to determine which additive components should be…

Methodology · Statistics 2021-09-20 Seonghyun Jeong , Taeyoung Park , David A. van Dyk

Robust Linear Mixed Models using Hierarchical Gamma-Divergence

Linear mixed models (LMMs) are a popular class of methods for analyzing longitudinal and clustered data. However, such models can be sensitive to outliers, and this can lead to biased inference on model parameters and inaccurate prediction…

Methodology · Statistics 2025-03-28 Shonosuke Sugasawa , Francis K. C. Hui , Alan H. Welsh

Bayesian Models for Joint Selection of Features and Auto-Regressive Lags: Theory and Applications in Environmental and Financial Forecasting

We develop a Bayesian framework for variable selection in linear regression with autocorrelated errors, accommodating lagged covariates and autoregressive structures. This setting occurs in time series applications where responses depend on…

Methodology · Statistics 2025-08-18 Alokesh Manna , Sujit K. Ghosh