Related papers: Significance Analysis for Pairwise Variable Select…

An iterative algorithm for joint covariate and random effect selection in mixed effects models

We consider joint selection of fixed and random effects in general mixed-effects models. The interpretation of estimated mixed-effects models is challenging since changing the structure of one set of effects can lead to different choices of…

Methodology · Statistics 2020-02-26 Maud Delattre , Marie-Anne Poursat

A Bayesian joint model of multiple longitudinal and categorical outcomes with application to multiple myeloma using permutation-based variable importance

Joint models have proven to be an effective approach for uncovering potentially hidden connections between various types of outcomes, mainly continuous, time-to-event, and binary. Typically, longitudinal continuous outcomes are…

Methodology · Statistics 2025-06-17 Danilo Alvares , Jessica K. Barrett , François Mercier , Jochen Schulze , Sean Yiu , Felipe Castro , Spyros Roumpanis , Yajing Zhu

Factorizable Joint Shift in Multinomial Classification

Factorizable joint shift (FJS) was recently proposed as a type of dataset shift for which the complete characteristics can be estimated from feature data observations on the test dataset by a method called Joint Importance Aligning. For the…

Machine Learning · Statistics 2022-09-19 Dirk Tasche

Significance Testing and Group Variable Selection

Let X; Z be r and s-dimensional covariates, respectively, used to model the response variable Y as Y = m(X;Z) + \sigma(X;Z)\epsilon. We develop an ANOVA-type test for the null hypothesis that Z has no influence on the regression function,…

Methodology · Statistics 2016-11-11 Adriano Zanin Zambom , Michael G. Akritas

JOINTVIP: Prioritizing variables in observational study design with joint variable importance plot in R

Credible causal effect estimation requires treated subjects and controls to be otherwise similar. In observational settings, such as analysis of electronic health records, this is not guaranteed. Investigators must balance background…

Methodology · Statistics 2024-07-11 Lauren D. Liao , Samuel D. Pimentel

Methods of Selective Inference for Linear Mixed Models: a Review and Empirical Comparison

Selective inference aims at providing valid inference after a data-driven selection of models or hypotheses. It is essential to avoid overconfident results and replicability issues. While significant advances have been made in this area for…

Methodology · Statistics 2025-03-14 Matteo D'Alessandro , Magne Thoresen

Variable selection in linear mixed effects models

This paper is concerned with the selection and estimation of fixed and random effects in linear mixed effects models. We propose a class of nonconcave penalized profile likelihood methods for selecting and estimating important fixed…

Statistics Theory · Mathematics 2012-11-05 Yingying Fan , Runze Li

Using Machine Learning to Test Causal Hypotheses in Conjoint Analysis

Conjoint analysis is a popular experimental design used to measure multidimensional preferences. Researchers examine how varying a factor of interest, while controlling for other relevant factors, influences decision-making. Currently,…

Methodology · Statistics 2024-11-20 Dae Woong Ham , Kosuke Imai , Lucas Janson

Variable selection for general index models via sliced inverse regression

Variable selection, also known as feature selection in machine learning, plays an important role in modeling high dimensional data and is key to data-driven scientific discoveries. We consider here the problem of detecting influential…

Methodology · Statistics 2014-09-24 Bo Jiang , Jun S. Liu

Sequential Advantage Selection for Optimal Treatment Regimes

Variable selection for optimal treatment regime in a clinical trial or an observational study is getting more attention. Most existing variable selection techniques focused on selecting variables that are important for prediction, therefore…

Methodology · Statistics 2014-05-22 Ailin Fan , Wenbin Lu , Rui Song

Importance sampling schemes for evidence approximation in mixture models

The marginal likelihood is a central tool for drawing Bayesian inference about the number of components in mixture models. It is often approximated since the exact form is unavailable. A bias in the approximation may be due to an incomplete…

Computation · Statistics 2014-11-14 Jeong Eun Lee , Christian P. Robert

Significance Tests for Neural Networks

We develop a pivotal test to assess the statistical significance of the feature variables in a single-layer feedforward neural network regression model. We propose a gradient-based test statistic and study its asymptotics using…

Statistics Theory · Mathematics 2020-11-10 Enguerrand Horel , Kay Giesecke

A scalable and efficient covariate selection criterion for mixed effects regression models with unknown random effects structure

We propose a new model selection criterion for mixed effects regression models that is computable when the model is fitted with a two-step method, even when the structure and the distribution of the random effects are unknown. The criterion…

Methodology · Statistics 2018-03-14 Radu V. Craiu , Thierry Duchesne

Selectivity in Probabilistic Causality: Drawing Arrows from Inputs to Stochastic Outputs

Given a set of several inputs into a system (e.g., independent variables characterizing stimuli) and a set of several stochastically non-independent outputs (e.g., random variables describing different aspects of responses), how can one…

Artificial Intelligence · Computer Science 2011-08-30 Ehtibar N. Dzhafarov , Janne V. Kujala

Finding Statistically Significant Attribute Interactions

In many data exploration tasks it is meaningful to identify groups of attribute interactions that are specific to a variable of interest. For instance, in a dataset where the attributes are medical markers and the variable of interest…

Machine Learning · Statistics 2017-03-17 Andreas Henelius , Antti Ukkonen , Kai Puolamäki

Identifying Gene-environment interactions with robust marginal Bayesian variable selection

In high-throughput genetics studies, an important aim is to identify gene-environment interactions associated with the clinical outcomes. Recently, multiple marginal penalization methods have been developed and shown to be effective in…

Methodology · Statistics 2021-02-24 Xi Lu , Kun Fan , Jie Ren , Cen Wu

Selective randomization inference for subgroup effects with continuous biomarkers

Randomization tests are a popular method for testing causal effects in clinical trials with finite-sample validity. In the presence of heterogeneous treatment effects, it is often of interest to select a subgroup that benefits from the…

Methodology · Statistics 2025-04-29 Zijun Gao

Stochastic Search Variable Selection for Bayesian Generalized Linear Mixed Effect Models

Variable selection remains a difficult problem, especially for generalized linear mixed models (GLMMs). While some frequentist approaches to simultaneously select joint fixed and random effects exist, primarily through the use of…

Methodology · Statistics 2024-12-03 Feng Ding , Ian Laga

Feature Selection for multi-labeled variables via Dependency Maximization

Feature selection and reducing the dimensionality of data is an essential step in data analysis. In this work, we propose a new criterion for feature selection that is formulated as conditional information between features given the labeled…

Machine Learning · Statistics 2019-05-20 Salimeh Yasaei Sekeh , Alfred O. Hero

Effect Size Estimation and Misclassification Rate Based Variable Selection in Linear Discriminant Analysis

Supervised classifying of biological samples based on genetic information, (e.g. gene expression profiles) is an important problem in biostatistics. In order to find both accurate and interpretable classification rules variable selection is…

Methodology · Statistics 2012-08-09 Bernd Klaus