English
Related papers

Related papers: Derandomizing Knockoffs

200 papers

Model-X knockoffs is a flexible wrapper method for high-dimensional regression algorithms, which provides guaranteed control of the false discovery rate (FDR). Due to the randomness inherent to the method, different runs of model-X…

Methodology · Statistics 2023-09-01 Zhimei Ren , Rina Foygel Barber

Recently, the scheme of model-X knockoffs was proposed as a promising solution to address controlled feature selection under high-dimensional finite-sample settings. However, the procedure of model-X knockoffs depends heavily on the…

Methodology · Statistics 2022-03-10 Xuebin Zhao , Hong Chen , Yingjie Wang , Weifu Li , Tieliang Gong , Yulong Wang , Feng Zheng

Many contemporary large-scale applications involve building interpretable models linking a large set of potential covariates to a response in a nonlinear fashion, such as when the response is binary. Although this modeling problem has been…

Methodology · Statistics 2017-12-13 Emmanuel Candes , Yingying Fan , Lucas Janson , Jinchi Lv

Model-X knockoffs is a wrapper that transforms essentially any feature importance measure into a variable selection algorithm, which discovers true effects while rigorously controlling the expected fraction of false positives. A frequently…

Methodology · Statistics 2024-03-12 Stephen Bates , Emmanuel Candès , Lucas Janson , Wenshuo Wang

Selecting important features in high-dimensional survival analysis is critical for identifying confirmatory biomarkers while maintaining rigorous error control. In this paper, we propose a derandomized knockoffs procedure for Cox regression…

Methodology · Statistics 2025-12-15 Rui Liu , Nan Sun

The Model-X knockoff procedure has recently emerged as a powerful approach for feature selection with statistical guarantees. The advantage of knockoff is that if we have a good model of the features X, then we can identify salient features…

Machine Learning · Statistics 2019-05-30 Jaime Roquero Gimenez , James Zou

This paper introduces a machine for sampling approximate model-X knockoffs for arbitrary and unspecified data distributions using deep generative models. The main idea is to iteratively refine a knockoff sampling mechanism until a criterion…

Methodology · Statistics 2020-03-03 Yaniv Romano , Matteo Sesia , Emmanuel J. Candès

We present a novel method for controlling the $k$-familywise error rate ($k$-FWER) in the linear regression setting using the knockoffs framework first introduced by Barber and Cand\`es. Our procedure, which we also refer to as knockoffs,…

Methodology · Statistics 2015-11-10 Lucas Janson , Weijie Su

Model-X knockoff framework offers a model-free variable selection method that ensures finite sample false discovery rate (FDR) control. However, the complexity of generating knockoff variables, coupled with the model-free assumption,…

Methodology · Statistics 2025-06-12 Zhanrui Cai , Yingying Fan , Lan Gao

Model-free knockoffs is a recently proposed technique for identifying covariates that is likely to have an effect on a response variable. The method is an efficient method to control the false discovery rate in hypothesis tests for separate…

Methodology · Statistics 2019-03-29 Lars Holden , Kristoffer Hellton

Model-X knockoff has garnered significant attention among various feature selection methods due to its guarantees for controlling the false discovery rate (FDR). Since its introduction in parametric design, knockoff techniques have evolved…

Machine Learning · Computer Science 2024-11-11 Hongyu Shen , Yici Yan , Zhizhen Zhao

A new statistical procedure (Model-X \cite{candes2018}) has provided a way to identify important factors using any supervised learning method controlling for FDR. This line of research has shown great potential to expand the horizon of…

Methodology · Statistics 2018-10-01 Ying Liu , Cheng Zheng

The recently proposed fixed-X knockoff is a powerful variable selection procedure that controls the false discovery rate (FDR) in any finite-sample setting, yet its theoretical insights are difficult to show beyond Gaussian linear models.…

Methodology · Statistics 2023-11-28 Han Su , Panxu Yuan , Qingyang Sun , Mengxi Yi , Gaorong Li

Controlling the False Discovery Rate (FDR) is critical for reproducible variable selection, especially given the prevalence of complex predictive modeling. The recent Split Knockoff method, an extension of the canonical Knockoffs framework,…

Methodology · Statistics 2025-09-05 Yang Cao , Hangyu Lin , Xinwei Sun , Yuan Yao

We consider the variable selection problem, which seeks to identify important variables influencing a response $Y$ out of many candidate features $X_1, \ldots, X_p$. We wish to do so while offering finite-sample guarantees about the…

Methodology · Statistics 2019-02-12 Rina Foygel Barber , Emmanuel J. Candès , Richard J. Samworth

We investigate the robustness of the model-X knockoffs framework with respect to the misspecified or estimated feature distribution. We achieve such a goal by theoretically studying the feature selection performance of a practically…

Methodology · Statistics 2024-06-06 Yingying Fan , Lan Gao , Jinchi Lv

The recent paper Cand\`es et al. (2018) introduced model-X knockoffs, a method for variable selection that provably and non-asymptotically controls the false discovery rate with no restrictions or assumptions on the dimensionality of the…

Methodology · Statistics 2020-06-16 Dongming Huang , Lucas Janson

Variable selection properties of procedures utilizing penalized-likelihood estimates is a central topic in the study of high dimensional linear regression problems. Existing literature emphasizes the quality of ranking of the variables by…

Statistics Theory · Mathematics 2022-04-28 Asaf Weinstein , Weijie J. Su , Małgorzata Bogdan , Rina F. Barber , Emmanuel J. Candès

Model-X knockoffs allows analysts to perform feature selection using almost any machine learning algorithm while still provably controlling the expected proportion of false discoveries. To apply model-X knockoffs, one must construct…

Methodology · Statistics 2021-06-30 Asher Spector , Lucas Janson

This paper develops a framework for testing for associations in a possibly high-dimensional linear model where the number of features/variables may far exceed the number of observational units. In this framework, the observations are split…

Methodology · Statistics 2018-05-04 Rina Foygel Barber , Emmanuel J. Candes
‹ Prev 1 2 3 10 Next ›