English
Related papers

Related papers: Statistical inference after variable selection in …

200 papers

Variable selection for regression models plays a key role in the analysis of biomedical data. However, inference after selection is not covered by classical statistical frequentist theory which assumes a fixed set of covariates in the…

Methodology · Statistics 2021-07-21 Michael Kammer , Daniela Dunkler , Stefan Michiels , Georg Heinze

We develop a post-selection inference method for the Cox proportional hazards model with interval-censored data, which provides asymptotically valid p-values and confidence intervals conditional on the model selected by lasso. The method is…

Methodology · Statistics 2024-01-02 Jianrui Zhang , Chenxi Li , Haolei Weng

For statistical inference on regression models with a diverging number of covariates, the existing literature typically makes sparsity assumptions on the inverse of the Fisher information matrix. Such assumptions, however, are often…

Methodology · Statistics 2021-06-08 Lu Xia , Bin Nan , Yi Li

We develop a general approach to valid inference after model selection. At the core of our framework is a result that characterizes the distribution of a post-selection estimator conditioned on the selection event. We specialize the…

Statistics Theory · Mathematics 2016-05-04 Jason D. Lee , Dennis L. Sun , Yuekai Sun , Jonathan E. Taylor

The problem of how to best select variables for confounding adjustment forms one of the key challenges in the evaluation of exposure effects in observational studies, and has been the subject of vigorous recent activity in causal inference.…

Methodology · Statistics 2021-12-02 Kelly Van Lancker , Oliver Dukes , Stijn Vansteelandt

We develop tools to do valid post-selective inference for a family of model selection procedures, including choosing a model via cross-validated Lasso. The tools apply universally when the following random vectors are jointly asymptotically…

Methodology · Statistics 2018-02-13 Jelena Markovic , Lucy Xia , Jonathan Taylor

Fulfilling the promise of precision medicine requires accurately and precisely classifying disease states. For cancer, this includes prediction of survival time from a surfeit of covariates. Such data presents an opportunity for improved…

Applications · Statistics 2017-06-22 Shannon R. McCurdy , Annette Molinaro , Lior Pachter

The analysis of randomized trials with time-to-event endpoints is nearly always plagued by the problem of censoring. As the censoring mechanism is usually unknown, analyses typically employ the assumption of non-informative censoring. While…

Methodology · Statistics 2020-07-17 Kelly Van Lancker , Oliver Dukes , Stijn Vansteelandt

Variable selection problem for the nonlinear Cox regression model is considered. In survival analysis, one main objective is to identify the covariates that are associated with the risk of experiencing the event of interest. The Cox…

Machine Learning · Statistics 2022-11-18 Kexuan Li

We develop methodology for valid inference after variable selection in logistic regression when the responses are partially observed, that is, when one observes a set of error-prone testing outcomes instead of the true values of the…

Methodology · Statistics 2025-04-17 Qinyan Shen , Karl Gregory , Xianzheng Huang

The instability in the selection of models is a major concern with data sets containing a large number of covariates. This paper deals with variable selection methodology in the case of high-dimensional problems where the response variable…

Applications · Statistics 2012-03-23 Marie Walschaerts , Eve Leconte , Philippe Besse

Among the most popular variable selection procedures in high-dimensional regression, Lasso provides a solution path to rank the variables and determines a cut-off position on the path to select variables and estimate coefficients. In this…

Methodology · Statistics 2018-06-19 X. Jessie Jeng , Huimin Peng , Wenbin Lu

Selective inference methods are developed for group lasso estimators for use with a wide class of distributions and loss functions. The method includes the use of exponential family distributions, as well as quasi-likelihood modeling for…

Methodology · Statistics 2024-03-28 Yiling Huang , Sarah Pirenne , Snigdha Panigrahi , Gerda Claeskens

We propose robust methods for inference on the effect of a treatment variable on a scalar outcome in the presence of very many controls. Our setting is a partially linear model with possibly non-Gaussian and heteroscedastic disturbances.…

Methodology · Statistics 2017-10-05 Alexandre Belloni , Victor Chernozhukov , Christian Hansen

The Tweedie exponential dispersion family is a popular choice among many to model insurance losses that consist of zero-inflated semicontinuous data. In such data, it is often important to obtain credibility (inference) of the most…

Methodology · Statistics 2025-07-17 Alokesh Manna , Zijian Huang , Dipak K. Dey , Yuwen Gu , Robin He

This thesis studies two problems in modern statistics. First, we study selective inference, or inference for hypothesis that are chosen after looking at the data. The motiving application is inference for regression coefficients selected by…

Machine Learning · Statistics 2015-07-02 Jason D. Lee

Selective inference (post-selection inference) is a methodology that has attracted much attention in recent years in the fields of statistics and machine learning. Naive inference based on data that are also used for model selection tends…

Methodology · Statistics 2021-11-25 Yoshiyuki Ninomiya , Yuta Umezu , Ichiro Takeuchi

We consider high-dimensional inference for potentially misspecified Cox proportional hazard models based on low dimensional results by Lin and Wei [1989]. A de-sparsified Lasso estimator is proposed based on the log partial likelihood…

Statistics Theory · Mathematics 2018-11-02 Shengchun Kong , Zhuqing Yu , Xianyang Zhang , Guang Cheng

When the model is not known and parameter testing or interval estimation is conducted after model selection, it is necessary to consider selective inference. This paper discusses this issue in the context of sparse estimation. Firstly, we…

Methodology · Statistics 2023-10-12 Joe Suzuki

We develop a set of variable selection methods for the Cox model under interval censoring, in the ultra-high dimensional setting where the dimensionality can grow exponentially with the sample size. The methods select covariates via a…

Methodology · Statistics 2024-05-03 Daewoo Pak , Jianrui Zhang , Di Wu , Haolei Weng , Chenxi Li
‹ Prev 1 2 3 10 Next ›