English
Related papers

Related papers: Pairwise Adjusted Mutual Information

200 papers

We introduce resampled mutual information (ResMI), a novel measure of clustering similarity that combines insights from information theoretic and pair counting approaches to clustering and community detection. Similar to chance-corrected…

Social and Information Networks · Computer Science 2024-12-06 Cheaheon Lim

Mutual information is commonly used as a measure of similarity between competing labelings of a given set of objects, for example to quantify performance in classification and community detection tasks. As argued recently, however, the…

Social and Information Networks · Computer Science 2025-07-17 Maximilian Jerdee , Alec Kirkley , M. E. J. Newman

Comparing clusterings is central to evaluating unsupervised models, yet the many existing similarity measures can produce widely divergent, sometimes contradictory, evaluations. Clustering similarity measures are typically organized into…

Machine Learning · Statistics 2025-11-06 Alexander J. Gates

Given the increasing popularity of algorithms for overlapping clustering, in particular in social network analysis, quantitative measures are needed to measure the accuracy of a method. Given a set of true clusters, and the set of clusters…

Physics and Society · Physics 2013-08-05 Aaron F. McDaid , Derek Greene , Neil Hurley

Normalized mutual information is widely used as a similarity measure for evaluating the performance of clustering and classification algorithms. In this paper, we argue that results returned by the normalized mutual information are biased…

Social and Information Networks · Computer Science 2025-12-23 Maximilian Jerdee , Alec Kirkley , M. E. J. Newman

Deep neural networks can memorize corrupted labels, making data quality critical for model performance, yet real-world datasets are frequently compromised by both label noise and input noise. This paper proposes a mutual information-based…

Machine Learning · Computer Science 2025-08-12 Jinghan Yang , Jiayu Weng

Clustering is at the very core of machine learning, and its applications proliferate with the increasing availability of data. However, as datasets grow, comparing clusterings with an adjustment for chance becomes computationally difficult,…

Machine Learning · Computer Science 2023-08-01 Kai Klede , Leo Schwinn , Dario Zanca , Björn Eskofier

In this paper we propose an active metric learning method for clustering with pairwise constraints. The proposed method actively queries the label of informative instance pairs, while estimating underlying metrics by incorporating unlabeled…

Machine Learning · Statistics 2021-11-10 Yujia Deng , Yubai Yuan , Haoda Fu , Annie Qu

Adjusted for chance measures are widely used to compare partitions/clusterings of the same data set. In particular, the Adjusted Rand Index (ARI) based on pair-counting, and the Adjusted Mutual Information (AMI) based on Shannon information…

Machine Learning · Statistics 2015-12-07 Simone Romano , Nguyen Xuan Vinh , James Bailey , Karin Verspoor

After a clustering solution is generated automatically, labelling these clusters becomes important to help understanding the results. In this paper, we propose to use a Mutual Information based method to label clusters of journal articles.…

Information Retrieval · Computer Science 2017-02-28 Rob Koopman , Shenghui Wang

The use of mutual information as a similarity measure in agglomerative hierarchical clustering (AHC) raises an important issue: some correction needs to be applied for the dimensionality of variables. In this work, we formulate the decision…

Machine Learning · Statistics 2016-08-07 Guillaume Marrelec , Arnaud Messé , Pierre Bellec

Adjusted similarity measures, such as Cohen's kappa for inter-rater reliability and the adjusted Rand index used to compare clustering algorithms, are a vital tool for comparing discrete labellings. These measures are intended to have the…

Methodology · Statistics 2026-01-16 William L. Lippitt , Edward J. Bedrick , Nichole E. Carlson

A recent article proposed reduced mutual information for evaluation of clustering, classification and community detection. The motivation is that the standard normalized mutual information (NMI) may give counter-intuitive answers under…

Social and Information Networks · Computer Science 2020-05-15 Zhong-Yuan Zhang

Estimating mutual information from observed samples is a basic primitive, useful in several machine learning tasks including correlation mining, information bottleneck clustering, learning a Chow-Liu tree, and conditional independence…

Information Theory · Computer Science 2018-10-11 Weihao Gao , Sreeram Kannan , Sewoong Oh , Pramod Viswanath

Distance metric learning algorithms aim to appropriately measure similarities and distances between data points. In the context of clustering, metric learning is typically applied with the assist of side-information provided by experts,…

Machine Learning · Computer Science 2021-05-27 Rodrigo Randel , Daniel Aloise , Alain Hertz

Mutual Information is the metric that is used to perform link adaptation, which allows to achieve rates near capacity. The computation of adaptive transmission modes is achieved by employing the mapping between the Signal to Noise Ratio and…

Signal Processing · Electrical Eng. & Systems 2018-07-26 Pol Henarejos , Ana Pérez-Neira , Anxo Tato , Carlos Mosquera

A measure of distance between two clusterings has important applications, including clustering validation and ensemble clustering. Generally, such distance measure provides navigation through the space of possible clusterings. Mostly used…

Social and Information Networks · Computer Science 2015-09-01 Reihaneh Rabbany , Osmar R. Zaïane

In this paper, we provide an approach to clustering relational matrices whose entries correspond to either similarities or dissimilarities between objects. Our approach is based on the value of information, a parameterized,…

Artificial Intelligence · Computer Science 2017-10-31 Isaac J. Sledge , Jose C. Principe

Mutual information (MI) is a useful information-theoretic measure to quantify the statistical dependence between two random variables: $X$ and $Y$. Often, we are interested in understanding how the dependence between $X$ and $Y$ in one set…

Information Theory · Computer Science 2025-07-22 Chetan Gohil , Oliver M Cliff , James M. Shine , Ben D. Fulcher , Joseph T. Lizier

To alleviate the data requirement for training effective binary classifiers in binary classification, many weakly supervised learning settings have been proposed. Among them, some consider using pairwise but not pointwise labels, when…

Machine Learning · Computer Science 2022-01-14 Lei Feng , Senlin Shu , Nan Lu , Bo Han , Miao Xu , Gang Niu , Bo An , Masashi Sugiyama
‹ Prev 1 2 3 10 Next ›