Related papers: Post-Processing Posterior Predictive P-values

Computational methods for fast Bayesian model assessment via calibrated posterior p-values

Posterior predictive p-values (ppps) have become popular tools for Bayesian model assessment, being general-purpose and easy to use. However, interpretation can be difficult because their distribution is not uniform under the hypothesis…

Methodology · Statistics 2024-02-01 Sally Paganin , Perry de Valpine

Posterior predictive p-values and the convex order

Posterior predictive p-values are a common approach to Bayesian model-checking. This article analyses their frequency behaviour, that is, their distribution when the parameters and the data are drawn from the prior and the model…

Statistics Theory · Mathematics 2015-03-31 Patrick Rubin-Delanchy , Daniel John Lawson

Joint $p$-Values for Higher-Powered Bayesian Model Checking with Frequentist Guarantees

We introduce a joint posterior $p$-value, an extension of the posterior predictive $p$-value for multiple test statistics, designed to address limitations of existing Bayesian $p$-values in the setting of continuous model expansion. In…

Methodology · Statistics 2023-12-13 Collin Cademartori

Asymptotically well-calibrated Bayesian $p$-value using the Kolmogorov-Smirnov statistic

The posterior predictive $p$-value (ppp) is widely used in Bayesian model evaluation. However, due to double use of the data, the ppp may not be a valid $p$-value even in large samples: The asymptotic null distribution of the ppp can be…

Statistics Theory · Mathematics 2026-01-13 Yueming Shen , Surya Tokdar

p-Values for Model Evaluation

Deciding whether a model provides a good description of data is often based on a goodness-of-fit criterion summarized by a p-value. Although there is considerable confusion concerning the meaning of p-values, leading to their misuse, they…

Data Analysis, Statistics and Probability · Physics 2013-05-29 Frederik Beaujean , Allen Caldwell , Daniel Kollar , Kevin Kroeninger

Assessment of P-value variability in the current replicability crisis

Increased availability of data and accessibility of computational tools in recent years have created unprecedented opportunities for scientific research driven by statistical analysis. Inherent limitations of statistics impose constrains on…

Genomics · Quantitative Biology 2016-09-13 Olga A. Vsevolozhskaya , Gabriel Ruiz , Dmitri V. Zaykin

Population Predictive Checks

Bayesian modeling helps applied researchers articulate assumptions about their data and develop models tailored for specific applications. Thanks to good methods for approximate posterior inference, researchers can now easily build, use,…

Methodology · Statistics 2023-11-22 Gemma E. Moran , David M. Blei , Rajesh Ranganath

Invariant $P$-values for model checking

$P$-values have been the focus of considerable criticism based on various considerations. Still, the $P$-value represents one of the most commonly used statistical tools. When assessing the suitability of a single hypothesized distribution,…

Statistics Theory · Mathematics 2010-01-13 Michael Evans , Gun Ho Jang

Towards replicability with confidence intervals for the exceedance probability

Several scientific fields including psychology are undergoing a replication crisis. There are many reasons for this problem, one of which is a misuse of p-values. There are several alternatives to p-values, and in this paper we describe a…

Methodology · Statistics 2020-10-05 Brian D. Segal

The Posterior Predictive Null

Bayesian model criticism is an important part of the practice of Bayesian statistics. Traditionally, model criticism methods have been based on the predictive check, an adaptation of goodness-of-fit testing to Bayesian modeling and an…

Methodology · Statistics 2022-07-08 Gemma E. Moran , John P. Cunningham , David M. Blei

The Letter Pi : Bayesian interpretation of p-values, Reproducibility and Considerations for Replication in the Generalized Linear Model

Significance testing based on p-values has been implicated in the reproducibility crisis in scientific research, with one of the proposals being to eliminate them in favor of Bayesian analyses. Defenders of the p-values have countered that…

Methodology · Statistics 2023-05-02 Christos Argyropoulos , Andy P Grieve

Posterior Predictive Propensity Scores and $p$-Values

\citet{Rosenbaum83ps} introduced the notion of the propensity score and discussed its central role in causal inference with observational studies. Their paper, however, caused a fundamental incoherence with an early paper by…

Methodology · Statistics 2022-03-29 Peng Ding , Tianyu Guo

Bayesian Posterior Interval Calibration to Improve the Interpretability of Observational Studies

Observational healthcare data offer the potential to estimate causal effects of medical products on a large scale. However, the confidence intervals and p-values produced by observational studies only account for random error and fail to…

Applications · Statistics 2024-05-02 Jami J. Mulgrave , David Madigan , George Hripcsak

Testing with p*-values: Between p-values, mid p-values, and e-values

We introduce the notion of p*-values (p*-variables), which generalizes p-values (p-variables) in several senses. The new notion has four natural interpretations: operational, probabilistic, Bayesian, and frequentist. A main example of a…

Statistics Theory · Mathematics 2022-02-24 Ruodu Wang

Limiting behavior of the Jeffreys Power-Expected-Posterior Bayes Factor in Gaussian Linear Models

Expected-posterior priors (EPP) have been proved to be extremely useful for testing hypothesis on the regression coefficients of normal linear models. One of the advantages of using EPPs is that impropriety of baseline priors causes no…

Computation · Statistics 2014-12-02 Dimitris Fouskakis , Ioannis Ntzoufras

Posterior Predictive P-values with Fisher Randomization Tests in Noncompliance Settings: Test Statistics vs Discrepancy Variables

In randomized experiments with noncompliance, tests may focus on compliers rather than on the overall sample. Rubin (1998) put forth such a method, and argued that testing for the complier average causal effect and averaging permutation…

Methodology · Statistics 2016-02-23 Laura Forastiere , Fabrizia Mealli , Luke Miratrix

Bayesian model checking: A comparison of tests

Two procedures for checking Bayesian models are compared using a simple test problem based on the local Hubble expansion. Over four orders of magnitude, p-values derived from a global goodness-of-fit criterion for posterior probability…

Instrumentation and Methods for Astrophysics · Physics 2018-06-27 Leon B. Lucy

Randomized Predictive P-values: A Versatile Model Diagnostic Tool with Unified Reference Distribution

Examining residuals such as Pearson and deviance residuals, is a standard tool for assessing normal regression. However, for discrete response, these residuals cluster on lines corresponding to distinct response values. Their distributions…

Methodology · Statistics 2020-07-07 Cindy Feng , Alireza Sadeghpour , Longhai Li

Meta-Uncertainty in Bayesian Model Comparison

Bayesian model comparison (BMC) offers a principled probabilistic approach to study and rank competing models. In standard BMC, we construct a discrete probability distribution over the set of possible models, conditional on the observed…

Machine Learning · Statistics 2023-02-22 Marvin Schmitt , Stefan T. Radev , Paul-Christian Bürkner

Power-Expected-Posterior Priors as Mixtures of g-Priors

One of the main approaches used to construct prior distributions for objective Bayes methods is the concept of random imaginary observations. Under this setup, the expected-posterior prior (EPP) offers several advantages, among which it has…

Methodology · Statistics 2020-10-09 Dimitris Fouskakis , Ioannis Ntzoufras