Related papers: Pearson-Matthews correlation coefficients for bina…

Weighted MCC: A Robust Measure of Multiclass Classifier Performance for Observations with Individual Weights

Several performance measures are used to evaluate binary and multiclass classification tasks. But individual observations may often have distinct weights, and none of these measures are sensitive to such varying weights. We propose a new…

Machine Learning · Statistics 2025-12-25 Rommel Cortez , Bala Krishnamoorthy

Statistical Inference of the Matthews Correlation Coefficient for Multiclass Classification

Classification problems are essential statistical tasks that form the foundation of decision-making across various fields, including patient prognosis and treatment strategies for critical conditions. Consequently, evaluating the…

Methodology · Statistics 2025-03-11 Jun Tamura , Yuki Itaya , Kenichi Hayashi , Kouji Yamamoto

Asymptotic Properties of Matthews Correlation Coefficient

Evaluating classifications is crucial in statistics and machine learning, as it influences decision-making across various fields, such as patient prognosis and therapy in critical conditions. The Matthews correlation coefficient (MCC) is…

Methodology · Statistics 2024-06-18 Yuki Itaya , Jun Tamura , Kenichi Hayashi , Kouji Yamamoto

A Multi-Way Correlation Coefficient

Pearson's correlation is an important summary measure of the amount of dependence between two variables. It is natural to want to generalise the concept of correlation as a single number that measures the inter-relatedness of three or more…

Methodology · Statistics 2020-03-06 Benjamin M. Taylor

The extension of Pearson correlation coefficient, measuring noise, and selecting features

Not a matter of serious contention, Pearson's correlation coefficient is still the most important statistical association measure. Restricted to just two variables, this measure sometimes doesn't live up to users' needs and expectations.…

Mathematical Finance · Quantitative Finance 2024-02-02 Reza Salimi , Kamran Pakizeh

Assessing Software Defection Prediction Performance: Why Using the Matthews Correlation Coefficient Matters

Context: There is considerable diversity in the range and design of computational experiments to assess classifiers for software defect prediction. This is particularly so, regarding the choice of classifier performance metrics.…

Software Engineering · Computer Science 2020-03-04 Jingxiu Yao , Martin Shepperd

Predictive Data Calibration for Linear Correlation Significance Testing

Inferring linear relationships lies at the heart of many empirical investigations. A measure of linear dependence should correctly evaluate the strength of the relationship as well as qualify whether it is meaningful for the population.…

Methodology · Statistics 2022-08-16 Kaustubh R. Patil , Simon B. Eickhoff , Robert Langner

A unifying view for performance measures in multi-class prediction

In the last few years, many different performance measures have been introduced to overcome the weakness of the most natural metric, the Accuracy. Among them, Matthews Correlation Coefficient has recently gained popularity among researchers…

Machine Learning · Statistics 2012-08-20 Giuseppe Jurman , Cesare Furlanello

Meta-evaluation of comparability metrics using parallel corpora

Metrics for measuring the comparability of corpora or texts need to be developed and evaluated systematically. Applications based on a corpus, such as training Statistical MT systems in specialised narrow domains, require finding a…

Computation and Language · Computer Science 2014-04-16 Bogdan Babych , Anthony Hartley

Metrics for Multi-Class Classification: an Overview

Classification tasks in machine learning involving more than two classes are known by the name of "multi-class classification". Performance indicators are very useful when the aim is to evaluate and compare different classification models…

Machine Learning · Statistics 2020-08-14 Margherita Grandini , Enrico Bagli , Giorgio Visani

Generalized Multiple Correlation Coefficient as a Similarity Measurements between Trajectories

Similarity distance measure between two trajectories is an essential tool to understand patterns in motion, for example, in Human-Robot Interaction or Imitation Learning. The problem has been faced in many fields, from Signal Processing,…

Human-Computer Interaction · Computer Science 2019-07-08 Julen Urain , Jan Peters

On the Importance of Asymmetry and Monotonicity Constraints in Maximal Correlation Analysis

The maximal correlation coefficient is a well-established generalization of the Pearson correlation coefficient for measuring non-linear dependence between random variables. It is appealing from a theoretical standpoint, satisfying…

Information Theory · Computer Science 2019-06-04 Elad Domanovitz , Uri Erez

Notes on the interpretation of dependence measures

Besides the classical distinction of correlation and dependence, many dependence measures bear further pitfalls in their application and interpretation. The aim of this paper is to raise and recall awareness of some of these limitations by…

Methodology · Statistics 2020-04-17 Björn Böttcher

Using functional information for binary classifications

The adequate use of information measured in a continuous manner along a period of time represents a methodological challenge. In the last decades, most of traditional statistical procedures have been extended for accommodating these…

Methodology · Statistics 2025-12-04 Pablo Martinez-Camblor

Analysis and Comparison of Classification Metrics

A variety of different performance metrics are commonly used in the machine learning literature for the evaluation of classification systems. Some of the most common ones for measuring quality of hard decisions are standard and balanced…

Machine Learning · Computer Science 2023-09-22 Luciana Ferrer

Measures of Correlation for Multiple Variables

Multivariate correlation analysis plays an important role in various fields such as statistics, economics, and big data analytics. In this paper, we propose a pair of measures, the unsigned correlation coefficient (UCC) and the unsigned…

Statistics Theory · Mathematics 2020-01-28 Jianji Wang , Nanning Zheng

Multinomial Multiple Correspondence Analysis

Relations between categorical variables can be analyzed conveniently by multiple correspondence analysis (MCA). %It is well suited to discover relations that may exist between categories of different variables. The graphical representation…

Methodology · Statistics 2016-03-11 Patrick J. F. Groenen , Julie Josse

Statistical dependence: Beyond Pearson's $\rho$

Pearson's $\rho$ is the most used measure of statistical dependence. It gives a complete characterization of dependence in the Gaussian case, and it also works well in some non-Gaussian situations. It is well known, however, that it has a…

Statistics Theory · Mathematics 2018-09-28 Dag Tjøstheim , Håkon Otneim , Bård Støve

A probabilistic methodology for multilabel classification

Multilabel classification is a relatively recent subfield of machine learning. Unlike to the classical approach, where instances are labeled with only one category, in multilabel classification, an arbitrary number of categories is chosen…

Artificial Intelligence · Computer Science 2013-03-01 Alfonso E. Romero , Luis M. de Campos

Robust performance metrics for imbalanced classification problems

We show that established performance metrics in binary classification, such as the F-score, the Jaccard similarity coefficient or Matthews' correlation coefficient (MCC), are not robust to class imbalance in the sense that if the proportion…

Machine Learning · Statistics 2024-04-12 Hajo Holzmann , Bernhard Klar