Related papers: Bounded P-values in Parametric Programming-based S…

More Powerful Conditional Selective Inference for Generalized Lasso by Parametric Programming

Conditional selective inference (SI) has been studied intensively as a new statistical inference framework for data-driven hypotheses. The basic concept of conditional SI is to make the inference conditional on the selection event, which…

Machine Learning · Statistics 2022-12-15 Vo Nguyen Le Duy , Ichiro Takeuchi

Parametric Programming Approach for More Powerful and General Lasso Selective Inference

Selective Inference (SI) has been actively studied in the past few years for conducting inference on the features of linear models that are adaptively selected by feature selection methods such as Lasso. The basic idea of SI is to make…

Machine Learning · Statistics 2021-02-23 Vo Nguyen Le Duy , Ichiro Takeuchi

More Powerful and General Selective Inference for Stepwise Feature Selection using the Homotopy Continuation Approach

Conditional selective inference (SI) has been actively studied as a new statistical inference framework for data-driven hypotheses. The basic idea of conditional SI is to make inferences conditional on the selection event characterized by a…

Machine Learning · Statistics 2021-04-23 Kazuya Sugiyama , Vo Nguyen Le Duy , Ichiro Takeuchi

Computing Valid p-value for Optimal Changepoint by Selective Inference using Dynamic Programming

There is a vast body of literature related to methods for detecting changepoints (CP). However, less attention has been paid to assessing the statistical reliability of the detected CPs. In this paper, we introduce a novel method to perform…

Machine Learning · Statistics 2021-02-23 Vo Nguyen Le Duy , Hiroki Toda , Ryota Sugiyama , Ichiro Takeuchi

Testing Conditional Independence in Supervised Learning Algorithms

We propose the conditional predictive impact (CPI), a consistent and unbiased estimator of the association between one or several features and a given outcome, conditional on a reduced feature set. Building on the knockoff framework of…

Methodology · Statistics 2021-05-14 David S. Watson , Marvin N. Wright

Prediction-Powered Causal Inferences

In many scientific experiments, the data annotating cost constraints the pace for testing novel hypotheses. Yet, modern machine learning pipelines offer a promising solution, provided their predictions yield correct conclusions. We focus on…

Machine Learning · Computer Science 2025-10-27 Riccardo Cadei , Ilker Demirel , Piersilvio De Bartolomeis , Lukas Lindorfer , Sylvia Cremer , Cordelia Schmid , Francesco Locatello

Verifying Performance Properties of Probabilistic Inference

In this extended abstract, we discuss the opportunity to formally verify that inference systems for probabilistic programming guarantee good performance. In particular, we focus on hybrid inference systems that combine exact and approximate…

Programming Languages · Computer Science 2023-07-17 Eric Atkinson , Ellie Y. Cheng , Guillaume Baudart , Louis Mandel , Michael Carbin

Dimension Reduction for Conditional Density Estimation with Applications to High-Dimensional Causal Inference

We propose a novel and computationally efficient approach for nonparametric conditional density estimation in high-dimensional settings that achieves dimension reduction without imposing restrictive distributional or functional form…

Econometrics · Economics 2025-10-14 Jianhua Mei , Fu Ouyang , Thomas T. Yang

More Powerful Selective Kernel Tests for Feature Selection

Refining one's hypotheses in the light of data is a common scientific practice; however, the dependency on the data introduces selection bias and can lead to specious statistical analysis. An approach for addressing this is via conditioning…

Machine Learning · Computer Science 2020-03-03 Jen Ning Lim , Makoto Yamada , Wittawat Jitkrittum , Yoshikazu Terada , Shigeyuki Matsui , Hidetoshi Shimodaira

Conditional independence testing: a predictive perspective

Conditional independence testing is a key problem required by many machine learning and statistics tools. In particular, it is one way of evaluating the usefulness of some features on a supervised prediction problem. We propose a novel…

Machine Learning · Statistics 2019-08-02 Marco Henrique de Almeida Inácio , Rafael Izbicki , Rafael Bassi Stern

Selective Inference in Propensity Score Analysis

Selective inference (post-selection inference) is a methodology that has attracted much attention in recent years in the fields of statistics and machine learning. Naive inference based on data that are also used for model selection tends…

Methodology · Statistics 2021-11-25 Yoshiyuki Ninomiya , Yuta Umezu , Ichiro Takeuchi

Selection by Prediction with Conformal p-values

Decision making or scientific discovery pipelines such as job hiring and drug discovery often involve multiple stages: before any resource-intensive step, there is often an initial screening that uses predictions from a machine learning…

Methodology · Statistics 2023-05-30 Ying Jin , Emmanuel J. Candès

Probabilistic Shapley Value Modeling and Inference

We propose probabilistic Shapley inference (PSI), a novel probabilistic framework to model and infer sufficient statistics of feature attributions in flexible predictive models, via latent random variables whose mean recovers Shapley…

Machine Learning · Computer Science 2025-09-09 Mert Ketenci , Iñigo Urteaga , Victor Alfonso Rodriguez , Noémie Elhadad , Adler Perotte

Selective inference is easier with p-values

Selective inference is a subfield of statistics that enables valid inference after selection of a data-dependent question. In this paper, we introduce selectively dominant p-values, a class of p-values that allow practitioners to easily…

Methodology · Statistics 2024-11-22 Anav Sood

Improving Power by Conditioning on Less in Post-selection Inference for Changepoints

Post-selection inference has recently been proposed as a way of quantifying uncertainty about detected changepoints. The idea is to run a changepoint detection algorithm, and then re-use the same data to perform a test for a change near…

Methodology · Statistics 2026-05-11 Rachel Carrington , Paul Fearnhead

Local Prediction-Powered Inference

To infer a function value on a specific point $x$, it is essential to assign higher weights to the points closer to $x$, which is called local polynomial / multivariable regression. In many practical cases, a limited sample size may ruin…

Machine Learning · Statistics 2024-09-30 Yanwu Gu , Dong Xia

Bayesian Prediction-Powered Inference

Prediction-powered inference (PPI) is a method that improves statistical estimates based on limited human-labeled data. Specifically, PPI methods provide tighter confidence intervals by combining small amounts of human-labeled data with…

Machine Learning · Computer Science 2024-05-13 R. Alex Hofer , Joshua Maynez , Bhuwan Dhingra , Adam Fisch , Amir Globerson , William W. Cohen

Selective inference after feature selection via multiscale bootstrap

It is common to show the confidence intervals or $p$-values of selected features, or predictor variables in regression, but they often involve selection bias. The selective inference approach solves this bias by conditioning on the…

Methodology · Statistics 2022-06-02 Yoshikazu Terada , Hidetoshi Shimodaira

More powerful post-selection inference, with application to the Lasso

Investigators often use the data to generate interesting hypotheses and then perform inference for the generated hypotheses. P-values and confidence intervals must account for this explorative data analysis. A fruitful method for doing so…

Methodology · Statistics 2018-02-06 Keli Liu , Jelena Markovic , Robert Tibshirani

PAIR-CI: Calibrated Conditional Independence Testing for Causal Discovery with Incomplete Data

The standard constraint-based paradigm for causal discovery with incomplete data -- impute first, test second -- is frequently miscalibrated: any consistent conditional independence (CI) test rejects a true null with probability approaching…

Methodology · Statistics 2026-05-07 Thomas S. Robinson , Ranjit Lall