Related papers: Statistical Analysis of Data Repeatability Measure…

A Supervised Learning Approach to Rankability

The rankability of data is a recently proposed problem that considers the ability of a dataset, represented as a graph, to produce a meaningful ranking of the items it contains. To study this concept, a number of rankability measures have…

Combinatorics · Mathematics 2022-03-15 Nathan McJames , David Malone , Oliver Mason

Ties in ranking scores can be treated as weighted samples

Prior proposals for cumulative statistics suggest making tiny random perturbations to the scores (independent variables in a regression) in order to ensure the scores' uniqueness. Uniqueness means that no score for any member of the…

Methodology · Statistics 2022-08-23 Mark Tygert

Permutation-Based Rank Test in the Presence of Discretization and Application in Causal Discovery with Mixed Data

Recent advances have shown that statistical tests for the rank of cross-covariance matrices play an important role in causal discovery. These rank tests include partial correlation tests as special cases and provide further graphical…

Machine Learning · Computer Science 2025-06-13 Xinshuai Dong , Ignavier Ng , Boyang Sun , Haoyue Dai , Guang-Yuan Hao , Shunxing Fan , Peter Spirtes , Yumou Qiu , Kun Zhang

A classification performance evaluation measure considering data separability

Machine learning and deep learning classification models are data-driven, and the model and the data jointly determine their classification performance. It is biased to evaluate the model's performance only based on the classifier accuracy…

Machine Learning · Computer Science 2022-11-11 Lingyan Xue , Xinyu Zhang , Weidong Jiang , Kai Huo

A First Step Towards Distribution Invariant Regression Metrics

Regression evaluation has been performed for decades. Some metrics have been identified to be robust against shifting and scaling of the data but considering the different distributions of data is much more difficult to address (imbalance…

Machine Learning · Computer Science 2020-09-14 Mario Michael Krell , Bilal Wehbe

Replicability Across Multiple Studies

Meta-analysis is routinely performed in many scientific disciplines. This analysis is attractive since discoveries are possible even when all the individual studies are underpowered. However, the meta-analytic discoveries may be entirely…

Methodology · Statistics 2023-05-09 Marina Bogomolov , Ruth Heller

A statistical framework for planning and analysing test-retest studies for repeatability of quantitative biomarker measurements

There is an increasing number of potential biomarkers that could allow for early assessment of treatment response or disease progression. However, measurements of quantitative biomarkers are subject to random variability. Hence, differences…

Methodology · Statistics 2026-03-02 Moritz Fabian Danzer , Maria Eveslage , Dennis Görlich , Benjamin Noto

Dependence-Robust Inference Using Resampled Statistics

We develop inference procedures robust to general forms of weak dependence. The procedures utilize test statistics constructed by resampling in a manner that does not depend on the unknown correlation structure of the data. We prove that…

Econometrics · Economics 2021-08-26 Michael P. Leung

Set-based differential covariance testing for high-throughput data

The problem of detecting changes in covariance for a single pair of features has been studied in some detail, but may be limited in importance or general applicability. In contrast, testing equality of covariance matrices of a {\it set} of…

Methodology · Statistics 2017-12-12 Yi-Hui Zhou

Hybrid data regression modelling in measurement

Measurement involves the determination of quantitative estimates of physical quantities from experiment, along with estimates of their associated uncertainties. Herewith an experimental system model is the key to extracting information from…

Applications · Statistics 2008-09-01 Vladimir B. Bokov

Rank-transformed subsampling: inference for multiple data splitting and exchangeable p-values

Many testing problems are readily amenable to randomised tests such as those employing data splitting. However despite their usefulness in principle, randomised tests have obvious drawbacks. Firstly, two analyses of the same dataset may…

Methodology · Statistics 2024-09-05 F. Richard Guo , Rajen D. Shah

Selecting the suitable resampling strategy for imbalanced data classification regarding dataset properties

In many application domains such as medicine, information retrieval, cybersecurity, social media, etc., datasets used for inducing classification models often have an unequal distribution of the instances of each class. This situation,…

Machine Learning · Computer Science 2022-01-21 Mohamed S. Kraiem , Fernando Sánchez-Hernández , María N. Moreno-García

Rethink Repeatable Measures of Robot Performance with Statistical Query

For a general standardized testing algorithm designed to evaluate a specific aspect of a robot's performance, several key expectations are commonly imposed. Beyond accuracy (i.e., closeness to a typically unknown ground-truth reference) and…

Robotics · Computer Science 2025-12-22 Bowen Weng , Linda Capito , Guillermo A. Castillo , Dylan Khor

Trustworthy Classification through Rank-Based Conformal Prediction Sets

Machine learning classification tasks often benefit from predicting a set of possible labels with confidence scores to capture uncertainty. However, existing methods struggle with the high-dimensional nature of the data and the lack of…

Machine Learning · Computer Science 2024-07-08 Rui Luo , Zhixin Zhou

Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous Control

Policy gradient methods in reinforcement learning have become increasingly prevalent for state-of-the-art performance in continuous control tasks. Novel methods typically benchmark against a few key algorithms such as deep deterministic…

Machine Learning · Computer Science 2017-08-15 Riashat Islam , Peter Henderson , Maziar Gomrokchi , Doina Precup

Assessing replicability of findings across two studies of multiple features

Replicability analysis aims to identify the findings that replicated across independent studies that examine the same features. We provide powerful novel replicability analysis procedures for two studies for FWER and for FDR control on the…

Methodology · Statistics 2019-03-01 Marina Bogomolov , Ruth Heller

On the Consistency of Fairness Measurement Methods for Regression Tasks

With growing applications of Machine Learning (ML) techniques in the real world, it is highly important to ensure that these models work in an equitable manner. One main step in ensuring fairness is to effectively measure fairness, and to…

Machine Learning · Computer Science 2024-06-21 Abdalwahab Almajed , Maryam Tabar , Peyman Najafirad

On Rosenbaum's Rank-based Matching Estimator

In two influential contributions, Rosenbaum (2005, 2020) advocated for using the distances between component-wise ranks, instead of the original data values, to measure covariate similarity when constructing matching estimators of average…

Statistics Theory · Mathematics 2024-01-09 Matias D. Cattaneo , Fang Han , Zhexiao Lin

Distance Correlation in Multiple Biased Sampling Models

Testing the independence between random vectors is a fundamental problem in statistics. Distance correlation, a recently popular dependence measure, is universally consistent for testing independence against all distributions with finite…

Methodology · Statistics 2024-08-22 Yuwei Ke , Hok Kan Ling , Yanglei Song

Causal Inference for Experiments with Latent Outcomes: Key Results and Their Implications for Design and Analysis

How should researchers analyze randomized experiments in which the main outcome is latent and measured in multiple ways but each measure contains some degree of error? We first identify a critical study-specific noncomparability problem in…

Econometrics · Economics 2026-01-13 Jiawei Fu , Donald P. Green