Related papers: p-Values for Model Evaluation

Divergence vs. Decision P-values: A Distinction Worth Making in Theory and Keeping in Practice

There are two distinct definitions of 'P-value' for evaluating a proposed hypothesis or model for the process generating an observed dataset. The original definition starts with a measure of the divergence of the dataset from what was…

Other Statistics · Statistics 2023-09-25 Sander Greenland

Invariant $P$-values for model checking

$P$-values have been the focus of considerable criticism based on various considerations. Still, the $P$-value represents one of the most commonly used statistical tools. When assessing the suitability of a single hypothesized distribution,…

Statistics Theory · Mathematics 2010-01-13 Michael Evans , Gun Ho Jang

p-Value as the Strength of Evidence Measured by Confidence Distribution

The notion of p-value is a fundamental concept in statistical inference and has been widely used for reporting outcomes of hypothesis tests. However, p-value is often misinterpreted, misused or miscommunicated in practice. Part of the issue…

Methodology · Statistics 2020-02-03 Sifan Liu , Regina Liu , Min-ge Xie

A note on p-values interpreted as plausibilities

P-values are a mainstay in statistics but are often misinterpreted. We propose a new interpretation of p-value as a meaningful plausibility, where this is to be interpreted formally within the inferential model framework. We show that, for…

Statistics Theory · Mathematics 2014-10-28 Ryan Martin , Chuanhai Liu

A Goodness-of-Fit Test for Statistical Models

Statistical modeling plays a fundamental role in understanding the underlying mechanism of massive data (statistical inference) and predicting the future (statistical prediction). Although all models are wrong, researchers try their best to…

Methodology · Statistics 2020-06-17 Hangjin Jiang

On $p$-values

Models are consistently treated as approximations and all procedures are consistent with this. They do not treat the model as being true. In this context $p$-values are one measure of approximation, a small $p$-value indicating a poor…

Other Statistics · Statistics 2016-11-21 Laurie Davies

P-values: misunderstood and misused

P-values are widely used in both the social and natural sciences to quantify the statistical significance of observed results. The recent surge of big data research has made the p-value an even more popular tool to test the significance of…

Applications · Statistics 2023-01-05 Bertie Vidgen , Taha Yasseri

Model selection in the average of inconsistent data: an analysis of the measured Planck-constant values

When the data do not conform to the hypothesis of a known sampling-variance, the fitting of a constant to a set of measured values is a long debated problem. Given the data, fitting would require to find what measurand value is the most…

Data Analysis, Statistics and Probability · Physics 2020-07-21 Giovanni Mana , Enrico Massa , Maria Predescu

Les p-values comme votes d'experts

The p-values are often implicitly used as a measure of evidence for the hypotheses of the tests. This practice has been analyzed with different approaches. It is generally accepted for the one-sided hypothesis problem, but it is often…

Statistics Theory · Mathematics 2007-06-13 Guy Morel

Post-Processing Posterior Predictive P-values

This article addresses issues of model criticism and model comparison in Bayesian contexts, and focusses on the use of the so-called posterior predictive p-values (ppp values). These involve a general discrepancy or conflict measure and…

Methodology · Statistics 2026-05-26 Nils Lid Hjort , Fredrik A. Dahl , Gunnhildur Högnadóttir Steinbakk

To P or not to P: on the evidential nature of P-values and their place in scientific inference

The customary use of P-values in scientific research has been attacked as being ill-conceived, and the utility of P-values has been derided. This paper reviews common misconceptions about P-values and their alleged deficits as indices of…

Methodology · Statistics 2013-11-04 Michael J. Lew

Testing with p*-values: Between p-values, mid p-values, and e-values

We introduce the notion of p*-values (p*-variables), which generalizes p-values (p-variables) in several senses. The new notion has four natural interpretations: operational, probabilistic, Bayesian, and frequentist. A main example of a…

Statistics Theory · Mathematics 2022-02-24 Ruodu Wang

Bayesian model checking: A comparison of tests

Two procedures for checking Bayesian models are compared using a simple test problem based on the local Hubble expansion. Over four orders of magnitude, p-values derived from a global goodness-of-fit criterion for posterior probability…

Instrumentation and Methods for Astrophysics · Physics 2018-06-27 Leon B. Lucy

Feature Selection using e-values

In the context of supervised parametric models, we introduce the concept of e-values. An e-value is a scalar quantity that represents the proximity of the sampling distribution of parameter estimates in a model trained on a subset of…

Machine Learning · Statistics 2022-07-19 Subhabrata Majumdar , Snigdhansu Chatterjee

On goodness-of-fit tests for arbitrary multivariate models

Goodness-of-fit tests are often used in data analysis to test the agreement of a distribution to a set of data. These tests can be used to detect an unknown signal against a known background or to set limits on a proposed signal…

Methodology · Statistics 2023-03-20 Lolian Shtembari , Allen Caldwell

Nonuniformity of P-values Can Occur Early in Diverging Dimensions

Evaluating the joint significance of covariates is of fundamental importance in a wide range of applications. To this end, p-values are frequently employed and produced by algorithms that are powered by classical large-sample asymptotic…

Methodology · Statistics 2017-05-11 Yingying Fan , Emre Demirkaya , Jinchi Lv

Assessment of P-value variability in the current replicability crisis

Increased availability of data and accessibility of computational tools in recent years have created unprecedented opportunities for scientific research driven by statistical analysis. Inherent limitations of statistics impose constrains on…

Genomics · Quantitative Biology 2016-09-13 Olga A. Vsevolozhskaya , Gabriel Ruiz , Dmitri V. Zaykin

P-value: A Bless or A Curse for Evidence-Based Studies?

As a convention, p-value is often computed in frequentist hypothesis testing and compared with the nominal significance level of 0.05 to determine whether or not to reject the null hypothesis. The smaller the p-value, the more significant…

Methodology · Statistics 2020-02-25 Haolun Shi , Guosheng Yin

P-values for classification

Let $(X,Y)$ be a random variable consisting of an observed feature vector $X\in \mathcal{X}$ and an unobserved class label $Y\in \{1,2,...,L\}$ with unknown joint distribution. In addition, let $\mathcal{D}$ be a training data set…

Statistics Theory · Mathematics 2008-06-26 Lutz Duembgen , Bernd-Wolfgang Igl , Axel Munk

Thou Shalt Not Reject the P-value

Since its debut in the 18th century, the P-value has been an important part of hypothesis testing-based scientific discoveries. As the statistical engine accelerates, questions are beginning to be raised, asking to what extent scientific…

Methodology · Statistics 2022-07-29 Oliver Y. Chén , Raúl G. Saraiva , Guy Nagels , Huy Phan , Tom Schwantje , Hengyi Cao , Jiangtao Gou , Jenna M. Reinen , Bin Xiong , Bangdong Zhi , Xiaojun Wang , Maarten de Vos