Related papers: Assessing Inference Methods

On the Use of Design-Based Simulations

Design-based simulations - procedures that hold realized outcomes fixed and generate variation by resampling treatment assignment or shocks - are widely used in both methodological and applied work to assess inference procedures. This paper…

Econometrics · Economics 2026-03-13 Bruno Ferman

Randomization Inference Tests for Shift-Share Designs

We consider the problem of inference in shift-share research designs. The choice between existing approaches that allow for unrestricted spatial correlation involves tradeoffs, varying in terms of their validity when there are relatively…

Econometrics · Economics 2022-06-03 Luis Alvarez , Bruno Ferman , Raoni Oliveira

Pitfalls and potentials in simulation studies: Questionable research practices in comparative simulation studies allow for spurious claims of superiority of any method

Comparative simulation studies are workhorse tools for benchmarking statistical methods. As with other empirical studies, the success of simulation studies hinges on the quality of their design, execution and reporting. If not conducted…

Methodology · Statistics 2023-03-10 Samuel Pawel , Lucas Kook , Kelly Reeve

Bridging the Gap Between Methodological Research and Statistical Practice: Toward "Translational Simulation Research

Simulations are valuable tools for empirically evaluating the properties of statistical methods and are primarily employed in methodological research to draw general conclusions about methods. In addition, they can often be useful to…

Other Statistics · Statistics 2025-10-08 Anne-Laure Boulesteix , Patrick Callahan , Luzia Hanssum , Vincent Gaertner , Eva Hoster

On Post-Selection Inference in A/B Tests

When interpreting A/B tests, we typically focus only on the statistically significant results and take them by face value. This practice, termed post-selection inference in the statistical literature, may negatively affect both point…

Applications · Statistics 2021-06-01 Alex Deng , Yicheng Li , Jiannan Lu , Vivek Ramamurthy

Effect Inference from Two-Group Data with Sampling Bias

In many applications, different populations are compared using data that are sampled in a biased manner. Under sampling biases, standard methods that estimate the difference between the population means yield unreliable inferences. Here we…

Statistics Theory · Mathematics 2019-11-12 Dave Zachariah , Petre Stoica

Examining False Positives under Inference Scaling for Mathematical Reasoning

Recent advancements in language models have led to significant improvements in mathematical reasoning across various benchmarks. However, most of these benchmarks rely on automatic evaluation methods that only compare final answers using…

Computation and Language · Computer Science 2025-09-19 Yu Wang , Nan Yang , Liang Wang , Furu Wei , Fuli Feng

The Problem with Assessing Statistical Methods

In this paper, we investigate the problem of assessing statistical methods and effectively summarizing results from simulations. Specifically, we consider problems of the type where multiple methods are compared on a reasonably large test…

Applications · Statistics 2015-10-07 Abigail Arnold , Jason Loeppky

A likelihood approach to proper analysis of secondary outcomes in matched case-control studies

Matched case-control studies are commonly employed in epidemiological research for their convenience and efficiency. Analysis of secondary outcomes can yield valuable insights into biological pathways and help identify genetic variants of…

Methodology · Statistics 2026-02-24 Shanshan Liu , Guoqing Diao

The Limits of Inference Scaling Through Resampling

Recent research has generated hope that inference scaling, such as resampling solutions until they pass verifiers like unit tests, could allow weaker models to match stronger ones. Beyond inference, this approach also enables training…

Machine Learning · Computer Science 2026-03-27 Benedikt Stroebl , Sayash Kapoor , Arvind Narayanan

On the role of benchmarking data sets and simulations in method comparison studies

Method comparisons are essential to provide recommendations and guidance for applied researchers, who often have to choose from a plethora of available approaches. While many comparisons exist in the literature, these are often not neutral…

Methodology · Statistics 2022-12-07 Sarah Friedrich , Tim Friede

Simulation as Experiment: An Empirical Critique of Simulation Research on Recommender Systems

Simulation can enable the study of recommender system (RS) evolution while circumventing many of the issues of empirical longitudinal studies; simulations are comparatively easier to implement, are highly controlled, and pose no ethical…

Computers and Society · Computer Science 2021-08-02 Amy A. Winecoff , Matthew Sun , Eli Lucherini , Arvind Narayanan

Controlling the False Discovery Proportion in Matched Observational Studies

We provide an approach to exploratory data analysis in matched observational studies with a single intervention and multiple endpoints. In such settings, the researcher would like to explore evidence for actual treatment effects among these…

Methodology · Statistics 2025-12-10 Mengqi Lin , Colin Fogarty

Risk management in the use of published statistical results for policy decisions

Statistical inferential results generally come with a measure of reliability for decision-making purposes. For a policy implementer, the value of implementing published policy research depends critically upon this reliability. For a policy…

Other Statistics · Statistics 2024-08-21 Duncan Ermini Leaf

The Sources of Statistical Bias Series: Simulated Demonstrations to Illustrate the Causes and Effects of Biases in Statistical Estimates

When teaching and discussing statistical assumptions, our focus is oftentimes placed on how to test and address potential violations rather than the effects of violating assumptions on the estimates produced by our statistical models. The…

Methodology · Statistics 2022-06-14 Ian A Silver

Toward a practical handbook for choosing among causal inference methods in non-randomized studies with binary outcomes: A simulation study for applied researchers

Applied researchers in biomedicine and related fields are often interested in estimating the causal effect of a treatment or intervention. Although randomized clinical trials are considered the gold standard for establishing causal effects,…

Methodology · Statistics 2026-05-14 Adrián Aurensanz-Crespo , Cristóbal M Rodríguez-Leal , Rosario Susi , Jorge Castillo-Mateo , Jesús Asín , José M Ramírez , Teresa Pérez

Methods of Selective Inference for Linear Mixed Models: a Review and Empirical Comparison

Selective inference aims at providing valid inference after a data-driven selection of models or hypotheses. It is essential to avoid overconfident results and replicability issues. While significant advances have been made in this area for…

Methodology · Statistics 2025-03-14 Matteo D'Alessandro , Magne Thoresen

Simulation-based stacking

Simulation-based inference has been popular for amortized Bayesian computation. It is typical to have more than one posterior approximation, from different inference algorithms, different architectures, or simply the randomness of…

Methodology · Statistics 2024-03-04 Yuling Yao , Bruno Régaldo-Saint Blancard , Justin Domke

Simulation Experiments as a Causal Problem

Simulation methods are among the most ubiquitous methodological tools in statistical science. In particular, statisticians often is simulation to explore properties of statistical functionals in models for which developed statistical theory…

Methodology · Statistics 2023-08-22 Tyrel Stokes , Ian Shrier , Russell Steele

Tracking the risk of a deployed model and detecting harmful distribution shifts

When deployed in the real world, machine learning models inevitably encounter changes in the data distribution, and certain -- but not all -- distribution shifts could result in significant performance degradation. In practice, it may make…

Machine Learning · Statistics 2022-05-06 Aleksandr Podkopaev , Aaditya Ramdas