English
Related papers

Related papers: Error-based Knockoffs Inference for Controlled Fea…

200 papers

Although there is a huge literature on feature selection for the Cox model, none of the existing approaches can control the false discovery rate (FDR) unless the sample size tends to infinity. In addition, there is no formal power analysis…

Methodology · Statistics 2023-08-02 Daoji Li , Jinzhao Yu , Hui Zhao

Model-X knockoffs is a flexible wrapper method for high-dimensional regression algorithms, which provides guaranteed control of the false discovery rate (FDR). Due to the randomness inherent to the method, different runs of model-X…

Methodology · Statistics 2023-09-01 Zhimei Ren , Rina Foygel Barber

The knockoffs is a recently proposed powerful framework that effectively controls the false discovery rate (FDR) for variable selection. However, none of the existing knockoff solutions are directly suited to handle multivariate or…

Methodology · Statistics 2024-06-28 Xinghao Qiao , Mingya Long , Qizhai Li

Thanks to its fine balance between model flexibility and interpretability, the nonparametric additive model has been widely used, and variable selection for this type of model has been frequently studied. However, none of the existing…

Methodology · Statistics 2022-01-10 Xiaowu Dai , Xiang Lyu , Lexin Li

Model-X knockoffs is a general procedure that can leverage any feature importance measure to produce a variable selection algorithm, which discovers true effects while rigorously controlling the number or fraction of false positives.…

Methodology · Statistics 2020-12-07 Zhimei Ren , Yuting Wei , Emmanuel Candès

This paper proposes a model-free and data-adaptive feature screening method for ultra-high dimensional datasets. The proposed method is based on the projection correlation which measures the dependence between two random vectors. This…

Methodology · Statistics 2021-02-16 Wanjun Liu , Yuan Ke , Jingyuan Liu , Runze Li

Model-X knockoff has garnered significant attention among various feature selection methods due to its guarantees for controlling the false discovery rate (FDR). Since its introduction in parametric design, knockoff techniques have evolved…

Machine Learning · Computer Science 2024-11-11 Hongyu Shen , Yici Yan , Zhizhen Zhao

We consider the variable selection problem, which seeks to identify important variables influencing a response $Y$ out of many candidate features $X_1, \ldots, X_p$. We wish to do so while offering finite-sample guarantees about the…

Methodology · Statistics 2019-02-12 Rina Foygel Barber , Emmanuel J. Candès , Richard J. Samworth

The recently proposed fixed-X knockoff is a powerful variable selection procedure that controls the false discovery rate (FDR) in any finite-sample setting, yet its theoretical insights are difficult to show beyond Gaussian linear models.…

Methodology · Statistics 2023-11-28 Han Su , Panxu Yuan , Qingyang Sun , Mengxi Yi , Gaorong Li

We investigate the robustness of the model-X knockoffs framework with respect to the misspecified or estimated feature distribution. We achieve such a goal by theoretically studying the feature selection performance of a practically…

Methodology · Statistics 2024-06-06 Yingying Fan , Lan Gao , Jinchi Lv

In many fields of science, we observe a response variable together with a large number of potential explanatory variables, and would like to be able to discover which variables are truly associated with the response. At the same time, we…

Methodology · Statistics 2015-10-15 Rina Foygel Barber , Emmanuel J. Candès

Many contemporary large-scale applications involve building interpretable models linking a large set of potential covariates to a response in a nonlinear fashion, such as when the response is binary. Although this modeling problem has been…

Methodology · Statistics 2017-12-13 Emmanuel Candes , Yingying Fan , Lucas Janson , Jinchi Lv

A new statistical procedure (Model-X \cite{candes2018}) has provided a way to identify important factors using any supervised learning method controlling for FDR. This line of research has shown great potential to expand the horizon of…

Methodology · Statistics 2018-10-01 Ying Liu , Cheng Zheng

We present a novel method for controlling the $k$-familywise error rate ($k$-FWER) in the linear regression setting using the knockoffs framework first introduced by Barber and Cand\`es. Our procedure, which we also refer to as knockoffs,…

Methodology · Statistics 2015-11-10 Lucas Janson , Weijie Su

Continuous improvement in medical imaging techniques allows the acquisition of higher-resolution images. When these are used in a predictive setting, a greater number of explanatory variables are potentially related to the dependent…

Statistics Theory · Mathematics 2019-03-13 Tuan-Binh Nguyen , Jérôme-Alexis Chevalier , Bertrand Thirion

Controlling the False Discovery Rate (FDR) is critical for reproducible variable selection, especially given the prevalence of complex predictive modeling. The recent Split Knockoff method, an extension of the canonical Knockoffs framework,…

Methodology · Statistics 2025-09-05 Yang Cao , Hangyu Lin , Xinwei Sun , Yuan Yao

Barber and Candes recently introduced a feature selection method called knockoff+ that controls the false discovery rate (FDR) among the selected features in the classical linear regression problem. Knockoff+ uses the competition between…

Methodology · Statistics 2019-11-25 Kristen Emery , Uri Keich

The Model-X knockoff procedure has recently emerged as a powerful approach for feature selection with statistical guarantees. The advantage of knockoff is that if we have a good model of the features X, then we can identify salient features…

Machine Learning · Statistics 2019-05-30 Jaime Roquero Gimenez , James Zou

This paper develops a framework for testing for associations in a possibly high-dimensional linear model where the number of features/variables may far exceed the number of observational units. In this framework, the observations are split…

Methodology · Statistics 2018-05-04 Rina Foygel Barber , Emmanuel J. Candes

Controlling false discovery rate (FDR) is crucial for variable selection, multiple testing, among other signal detection problems. In literature, there is certainly no shortage of FDR control strategies when selecting individual features,…

Methodology · Statistics 2022-04-11 Jingyuan Liu , Ao Sun , Yuan Ke
‹ Prev 1 2 3 10 Next ›