Related papers: Detecting p-hacking

The Power of Tests for Detecting $p$-Hacking

A flourishing empirical literature investigates the prevalence of $p$-hacking based on the distribution of $p$-values across studies. Interpreting results in this literature requires a careful understanding of the power of methods for…

Econometrics · Economics 2025-08-12 Graham Elliott , Nikolay Kudrin , Kaspar Wüthrich

A Short Note on P-Value Hacking

We present the expected values from p-value hacking as a choice of the minimum p-value among $m$ independents tests, which can be considerably lower than the "true" p-value, even with a single trial, owing to the extreme skewness of the…

Applications · Statistics 2018-01-29 Nassim Nicholas Taleb

When is p-hacking detectable?

We show that some forms of p-hacking cannot be detected by examining the histogram of t-statistics or their p-values. Even when p-hacking is detectable, standard tests may lack power. We propose a novel test that detects every form of…

Econometrics · Economics 2026-05-14 Stefan Faridani

General Behaviour of P-Values Under the Null and Alternative

Hypothesis testing results often rely on simple, yet important assumptions about the behaviour of the distribution of p-values under the null and the alternative. We examine tests for one dimensional parameters of interest that converge to…

Statistics Theory · Mathematics 2021-08-06 Yanbo Tang , Radu Craiu , Lei Sun

Critical Values Robust to P-hacking

P-hacking is prevalent in reality but absent from classical hypothesis testing theory. As a consequence, significant results are much more common than they are supposed to be when the null hypothesis is in fact true. In this paper, we build…

Econometrics · Economics 2024-05-09 Adam McCloskey , Pascal Michaillat

Multiple testing of composite null hypotheses for discrete data using randomized $p$-values

$P$-values that are derived from continuously distributed test statistics are typically uniformly distributed on $(0,1)$ under least favorable parameter configurations (LFCs) in the null hypothesis. Conservativeness of a $p$-value $P$…

Methodology · Statistics 2023-03-13 Daniel Ochieng , Anh-Tuan Hoang , Thorsten Dickhaus

Multiple Test Functions and Adjusted p-Values for Test Statistics with Discrete Distributions

The randomized $p$-value, (nonrandomized) mid-$p$-value and abstract randomized $p$-value have all been recommended for testing a null hypothesis whenever the test statistic has a discrete distribution. This paper provides a unifying…

Computation · Statistics 2014-12-02 Joshua D Habiger

A robust and p-hacking-proof significance test under variance uncertainty

P-hacking poses challenges to traditional hypothesis testing. In this paper, we propose a robust method for the one-sample significance test that can protect against p-hacking from sample manipulation. Precisely, assuming a sequential…

Statistics Theory · Mathematics 2025-02-18 Xifeng Li , Shuzhen Yang , Jianfeng Yao

Randomized p-values for multiple testing and their application in replicability analysis

We are concerned with testing replicability hypotheses for many endpoints simultaneously. This constitutes a multiple test problem with composite null hypotheses. Traditional $p$-values, which are computed under least favourable parameter…

Methodology · Statistics 2020-02-26 Anh-Tuan Hoang , Thorsten Dickhaus

p-Value as the Strength of Evidence Measured by Confidence Distribution

The notion of p-value is a fundamental concept in statistical inference and has been widely used for reporting outcomes of hypothesis tests. However, p-value is often misinterpreted, misused or miscommunicated in practice. Part of the issue…

Methodology · Statistics 2020-02-03 Sifan Liu , Regina Liu , Min-ge Xie

Modelling publication bias and p-hacking

Publication bias and p-hacking are two well-known phenomena that strongly affect the scientific literature and cause severe problems in meta-analyses. Due to these phenomena, the assumptions of meta-analyses are seriously violated and the…

Methodology · Statistics 2020-02-26 Jonas Moss , Riccardo De Bin

Multiple Testing of One-Sided Hypotheses with Conservative $p$-values

We study a large-scale one-sided multiple testing problem in which test statistics follow normal distributions with unit variance, and the goal is to identify signals with positive mean effects. A conventional approach is to compute…

Methodology · Statistics 2026-05-15 Kwangok Seo , Johan Lim , Hyungwon Choi , Jaesik Jeong

Testing for Outliers with Conformal p-values

This paper studies the construction of p-values for nonparametric outlier detection, taking a multiple-testing perspective. The goal is to test whether new independent samples belong to the same distribution as a reference data set or are…

Methodology · Statistics 2024-03-12 Stephen Bates , Emmanuel Candès , Lihua Lei , Yaniv Romano , Matteo Sesia

A new class of nonparametric tests for second-order stochastic dominance based on the Lorenz P-P plot

Given samples from two non-negative random variables, we propose a family of tests for the null hypothesis that one random variable stochastically dominates the other at the second order. Test statistics are obtained as functionals of the…

Statistics Theory · Mathematics 2023-10-16 Tommaso Lando , Sirio Legramanti

Supplemental Studies for Simultaneous Goodness-of-Fit Testing

Testing to see whether a given data set comes from some specified distribution is among the oldest types of problems in Statistics. Many such tests have been developed and their performance studied. The general result has been that while a…

Applications · Statistics 2020-12-07 Wolfgang Rolke

Consistency of $p$-norm based tests in high dimensions: characterization, monotonicity, domination

Many commonly used test statistics are based on a norm measuring the evidence against the null hypothesis. To understand how the choice of a norm affects power properties of tests in high dimensions, we study the consistency sets of…

Statistics Theory · Mathematics 2022-02-01 Anders Bredahl Kock , David Preinerstorfer

Post-hoc $\alpha$ Hypothesis Testing and the Post-hoc $p$-value

In traditional hypothesis testing one must pre-specify the significance level $\alpha$ to bound the `size' of the test: its probability to falsely reject the hypothesis. Indeed, a data-dependent selection of $\alpha$ would generally distort…

Statistics Theory · Mathematics 2025-12-03 Nick W. Koning

Distribution-Free Pointwise Adjusted P-Values for Functional Hypotheses

Graphical tests assess whether a function of interest departs from an envelope of functions generated under a simulated null distribution. This approach originated in spatial statistics, but has recently gained some popularity in functional…

Methodology · Statistics 2020-06-25 Meng Xu , Philip T. Reiss

Compound p-Value Statistics for Multiple Testing Procedures

Many multiple testing procedures make use of the p-values from the individual pairs of hypothesis tests, and are valid if the p-value statistics are independent and uniformly distributed under the null hypotheses. However, it has recently…

Methodology · Statistics 2011-08-25 Joshua D. Habiger , Edsel A. Pena

Defending the P-value

Attacks on the P-value are nothing new, but the recent attacks are increasingly more serious. They come from more mainstream sources, with widening targets such as a call to retire the significance testing altogether. While well meaning, I…

Other Statistics · Statistics 2022-01-11 Yudi Pawitan