Related papers: Understanding parameter differences between analys…

"What is Different Between These Datasets?" A Framework for Explaining Data Distribution Shifts

The performance of machine learning models relies heavily on the quality of input data, yet real-world applications often face significant data-related challenges. A common issue arises when curating training data or deploying models: two…

Machine Learning · Computer Science 2025-09-24 Varun Babbar , Zhicheng Guo , Cynthia Rudin

Statistical Models with Uncertain Error Parameters

In a statistical analysis in Particle Physics, nuisance parameters can be introduced to take into account various types of systematic uncertainties. The best estimate of such a parameter is often modeled as a Gaussian distributed variable…

Data Analysis, Statistics and Probability · Physics 2019-02-25 Glen Cowan

Sensitivity Analysis of the Consistency Assumption

Sensitivity analysis informs causal inference by assessing the sensitivity of conclusions to departures from assumptions. The consistency assumption states that there are no hidden versions of treatment and that the outcome arising…

Methodology · Statistics 2025-12-29 Brian Knaeble , Qinyun Lin , Erich Kummerfeld , Kenneth A. Frank

A variability measure for estimates of parameters in interval data fitting

The paper presents a construction of a quantitative measure of variability for parameter estimates in the data fitting problem under interval uncertainty. It shows the degree of variability and ambiguity of the estimate, and the need for…

Numerical Analysis · Mathematics 2020-03-12 Sergey P. Shary

Elements and Principles for Characterizing Variation between Data Analyses

The data revolution has led to an increased interest in the practice of data analysis. For a given problem, there can be significant or subtle differences in how a data analyst constructs or creates a data analysis, including differences in…

Applications · Statistics 2019-07-29 Stephanie C. Hicks , Roger D. Peng

Evidence-invariant Sensitivity Bounds

The sensitivities revealed by a sensitivity analysis of a probabilistic network typically depend on the entered evidence. For a real-life network therefore, the analysis is performed a number of times, with different evidence. Although…

Artificial Intelligence · Computer Science 2012-07-19 Silja Renooij , Linda C. van der Gaag

On the role of parametrization in models with a misspecified nuisance component

The paper is concerned with inference for a parameter of interest in models that share a common interpretation for that parameter but that may differ appreciably in other respects. We study the general structure of models under which the…

Statistics Theory · Mathematics 2024-08-06 Heather Battey , Nancy Reid

Noise and nonlinearities in high-throughput data

High-throughput data analyses are becoming common in biology, communications, economics and sociology. The vast amounts of data are usually represented in the form of matrices and can be considered as knowledge networks. Spectra-based…

Quantitative Methods · Quantitative Biology 2010-01-06 Viet-Anh Nguyen , Zdena Koukolikova-Nicola , Franco Bagnoli , Pietro Lio

A Comprehensive Framework for Statistical Inference in Measurement System Assessment Studies

Measurement system analysis aims to quantify the variability in data attributable to the measurement system and evaluate its contribution to overall data variability. This paper conducts a rigorous theoretical investigation of the…

Applications · Statistics 2025-01-31 Banafsheh Lashkari , Shojaeddin Chenouri

Statistical methods: Basic concepts, interpretations, and cautions

The study of associations and their causal explanations is a central research activity whose methodology varies tremendously across fields. Even within specialized subfields, comparisons across textbooks and journals reveals that the basics…

Methodology · Statistics 2025-10-13 Sander Greenland

Model selection in the average of inconsistent data: an analysis of the measured Planck-constant values

When the data do not conform to the hypothesis of a known sampling-variance, the fitting of a constant to a set of measured values is a long debated problem. Given the data, fitting would require to find what measurand value is the most…

Data Analysis, Statistics and Probability · Physics 2020-07-21 Giovanni Mana , Enrico Massa , Maria Predescu

Asymptotics for estimating a diverging number of parameters -- with and without sparsity

We consider high-dimensional estimation problems where the number of parameters diverges with the sample size. General conditions are established for consistency, uniqueness, and asymptotic normality in both unpenalized and penalized…

Statistics Theory · Mathematics 2025-04-08 Jana Gauss , Thomas Nagler

Toward Falsifying Causal Graphs Using a Permutation-Based Test

Understanding causal relationships among the variables of a system is paramount to explain and control its behavior. For many real-world systems, however, the true causal graph is not readily available and one must resort to predictions…

Machine Learning · Statistics 2024-12-20 Elias Eulig , Atalanti A. Mastakouri , Patrick Blöbaum , Michaela Hardt , Dominik Janzing

Sampling Errors in Nested Sampling Parameter Estimation

Sampling errors in nested sampling parameter estimation differ from those in Bayesian evidence calculation, but have been little studied in the literature. This paper provides the first explanation of the two main sources of sampling errors…

Methodology · Statistics 2018-12-11 Edward Higson , Will Handley , Mike Hobson , Anthony Lasenby

Learning Lie Group Symmetry Transformations with Neural Networks

The problem of detecting and quantifying the presence of symmetries in datasets is useful for model selection, generative modeling, and data analysis, amongst others. While existing methods for hard-coding transformations in neural networks…

Machine Learning · Computer Science 2023-07-06 Alex Gabel , Victoria Klein , Riccardo Valperga , Jeroen S. W. Lamb , Kevin Webster , Rick Quax , Efstratios Gavves

Out-of-distribution generalization under random, dense distributional shifts

Many existing approaches for estimating parameters in settings with distributional shifts operate under an invariance assumption. For example, under covariate shift, it is assumed that $p(y|x)$ remains invariant. We refer to such…

Methodology · Statistics 2025-02-07 Yujin Jeong , Dominik Rothenhäusler

Detecting unusual input to neural networks

Evaluating a neural network on an input that differs markedly from the training data might cause erratic and flawed predictions. We study a method that judges the unusualness of an input by evaluating its informative content compared to the…

Machine Learning · Computer Science 2020-06-16 Jörg Martin , Clemens Elster

Parameter identifiability, parameter estimation and model prediction for differential equation models

Interpreting data with mathematical models is an important aspect of real-world industrial and applied mathematical modeling. Often we are interested to understand the extent to which a particular set of data informs and constrains model…

Methodology · Statistics 2025-03-06 Matthew J Simpson , Ruth E Baker

Detection and inference of changes in high-dimensional linear regression with non-sparse structures

For data segmentation in high-dimensional linear regression settings, the regression parameters are often assumed to be sparse segment-wise, which enables many existing methods to estimate the parameters locally via $\ell_1$-regularised…

Methodology · Statistics 2026-05-08 Haeran Cho , Tobias Kley , Housen Li

Investigating Data Variance in Evaluations of Automatic Machine Translation Metrics

Current practices in metric evaluation focus on one single dataset, e.g., Newstest dataset in each year's WMT Metrics Shared Task. However, in this paper, we qualitatively and quantitatively show that the performances of metrics are…

Computation and Language · Computer Science 2022-04-21 Jiannan Xiang , Huayang Li , Yahui Liu , Lemao Liu , Guoping Huang , Defu Lian , Shuming Shi