Related papers: Beyond Support in Two-Stage Variable Selection

Improving variable selection properties with data integration and transfer learning

We study variable selection (also called support recovery) in high-dimensional sparse linear regression when one has external information on which variables are likely to be associated with the response. Consistent recovery is only possible…

Statistics Theory · Mathematics 2026-02-16 Paul Rognon-Vael , David Rossell , Piotr Zwiernik

Two-Stage Robust and Sparse Distributed Statistical Inference for Large-Scale Data

In this paper, we address the problem of conducting statistical inference in settings involving large-scale data that may be high-dimensional and contaminated by outliers. The high volume and dimensionality of the data require distributed…

Machine Learning · Statistics 2022-11-30 Emadaldin Mozafari-Majd , Visa Koivunen

A Two-Stage Variable Selection Approach for Correlated High Dimensional Predictors

When fitting statistical models, some predictors are often found to be correlated with each other, and functioning together. Many group variable selection methods are developed to select the groups of predictors that are closely related to…

Methodology · Statistics 2021-03-25 Zhiyuan Li

Multistage Adaptive Estimation of Sparse Signals

This paper considers sequential adaptive estimation of sparse signals under a constraint on the total sensing effort. The advantage of adaptivity in this context is the ability to focus more resources on regions of space where signal…

Methodology · Statistics 2013-04-03 Dennis Wei , Alfred O. Hero

A High-dimensional M-estimator Framework for Bi-level Variable Selection

In high-dimensional data analysis, bi-level sparsity is often assumed when covariates function group-wisely and sparsity can appear either at the group level or within certain groups. In such cases, an ideal model should be able to…

Methodology · Statistics 2021-09-14 Bin Luo , Xiaoli Gao

Two-Stage Testing in a high dimensional setting

In a high dimensional regression setting in which the number of variables ($p$) is much larger than the sample size ($n$), the number of possible two-way interactions between the variables is immense. If the number of variables is in the…

Methodology · Statistics 2024-06-26 Marianne A Jonker , Luc van Schijndel , Eric Cator

Selection of variables and decision boundaries for functional data via bi-level selection

Sparsity-inducing penalties are useful tools for variable selection and they are also effective for regression settings where the data are functions. We consider the problem of selecting not only variables but also decision boundaries in…

Methodology · Statistics 2020-06-01 Hidetoshi Matsui

Analyzing two-stage experiments in the presence of interference

Two-stage randomization is a powerful design for estimating treatment effects in the presence of interference; that is, when one individual's treatment assignment affects another individual's outcomes. Our motivating example is a two-stage…

Applications · Statistics 2017-05-02 Guillaume Basse , Avi Feller

Inference for relative sparsity

In healthcare, there is much interest in estimating policies, or mappings from covariates to treatment decisions. Recently, there is also interest in constraining these estimated policies to the standard of care, which generated the…

Methodology · Statistics 2026-02-17 Samuel J. Weisenthal , Sally W. Thurston , Ashkan Ertefaie

Variable Selection in Causal Inference Using Penalization

In the causal adjustment setting, variable selection techniques based on either the outcome or treatment allocation model can result in the omission of confounders or the inclusion of spurious variables in the propensity score. We propose a…

Statistics Theory · Mathematics 2014-06-06 Ashkan Ertefaie , Masoud Asgharian , David A. Stephens

Establishment and Solution of a Multi-Stage Decision Model Based on Hypothesis Testing and Dynamic Programming Algorithm

This paper introduces a novel multi-stage decision-making model that integrates hypothesis testing and dynamic programming algorithms to address complex decision-making scenarios.Initially,we develop a sampling inspection scheme that…

Systems and Control · Electrical Eng. & Systems 2025-03-11 Ziyang Liu , Yurui Hu , Yihan Deng

Projective Inference in High-dimensional Problems: Prediction and Feature Selection

This paper discusses predictive inference and feature selection for generalized linear models with scarce but high-dimensional data. We argue that in many cases one can benefit from a decision theoretically justified two-stage approach:…

Machine Learning · Statistics 2020-11-09 Juho Piironen , Markus Paasiniemi , Aki Vehtari

Asymptotic Inference for Multi-Stage Stationary Treatment Policy with Variable Selection

Dynamic treatment regimes or policies are a sequence of decision functions over multiple stages that are tailored to individual features. One important class of treatment policies in practice, namely multi-stage stationary treatment…

Machine Learning · Statistics 2025-01-09 Daiqi Gao , Yufeng Liu , Donglin Zeng

Optimal Two-Step Prediction in Regression

High-dimensional prediction typically comprises two steps: variable selection and subsequent least-squares refitting on the selected variables. However, the standard variable selection procedures, such as the lasso, hinge on tuning…

Methodology · Statistics 2017-06-07 Didier Chételat , Johannes Lederer , Joseph Salmon

Accurate Inference for Penalized Logistic Regression

Inference for high-dimensional logistic regression models using penalized methods has been a challenging research problem. As an illustration, a major difficulty is the significant bias of the Lasso estimator, which limits its direct…

Methodology · Statistics 2024-10-29 Yuming Zhang , Stéphane Guerrier , Runze Li

Inference for Two-Stage Extremum Estimators

We present a simulation-based inference approach for two-stage estimators, focusing on extremum estimators in the second stage. We accommodate a broad range of first-stage estimators, including extremum estimators, high-dimensional…

Econometrics · Economics 2024-11-08 Aristide Houndetoungan , Abdoul Haki Maoude

Adaptive Two-stage Stochastic Programming with an Analysis on Capacity Expansion Planning Problem

Multi-stage stochastic programming is a well-established framework for sequential decision making under uncertainty by seeking policies that are fully adapted to the uncertainty. Often such flexible policies are not desirable, and the…

Optimization and Control · Mathematics 2024-08-06 Beste Basciftci , Shabbir Ahmed , Nagi Gebraeel

Weak Signal Identification and Inference in Penalized Model Selection

Weak signal identification and inference are very important in the area of penalized model selection, yet they are under-developed and not well-studied. Existing inference procedures for penalized estimators are mainly focused on strong…

Methodology · Statistics 2016-11-16 Peibei Shi , Annie Qu

Bayesian Stability Selection and Inference on Selection Probabilities

Stability selection is a versatile framework for structure estimation and variable selection in high-dimensional setting, primarily grounded in frequentist principles. In this paper, we propose an enhanced methodology that integrates…

Methodology · Statistics 2026-05-05 Mahdi Nouraie , Connor Smith , Samuel Muller

Maximum Score Estimation of Preference Parameters for a Binary Choice Model under Uncertainty

This paper develops maximum score estimation of preference parameters in the binary choice model under uncertainty in which the decision rule is affected by conditional expectations. The preference parameters are estimated in two stages: we…

Methodology · Statistics 2013-12-03 Le-Yu Chen , Sokbae Lee , Myung Jae Sung