English
Related papers

Related papers: A Bayesian cluster validity index

200 papers

Nonhierarchical clustering depending on unsupervised algorithms may not retrieve the optimal partition of datasets. Determining if clusters fit ``natural partitions`` can be achieved using cluster validity indices (CVIs). Most existing CVIs…

Methodology · Statistics 2019-06-04 Anri Mutoh , Masamichi Wada , Kou Amano

Relative Validity Indices (RVIs) such as the Silhouette Width Criterion and Davies Bouldin indices are the most widely used tools for evaluating and optimising clustering outcomes. Traditionally, their ability to rank collections of…

There are various cluster validity indices used for evaluating clustering results. One of the main objectives of using these indices is to seek the optimal unknown number of clusters. Some indices work well for clusters with different…

Machine Learning · Statistics 2024-01-09 Nathakhun Wiroonsri

To evaluate clustering results is a significant part of cluster analysis. There are no true class labels for clustering in typical unsupervised learning. Thus, a number of internal evaluations, which use predicted labels and data, have been…

Machine Learning · Computer Science 2021-01-06 Shuyue Guan , Murray Loew

The optimal number of clusters is one of the main concerns when applying cluster analysis. Several cluster validity indexes have been introduced to address this problem. However, in some situations, there is more than one option that can be…

Machine Learning · Statistics 2025-12-24 Nathakhun Wiroonsri , Onthada Preedasawakul

To evaluate clustering results is a significant part of cluster analysis. Since there are no true class labels for clustering in typical unsupervised learning, many internal cluster validity indices (CVIs), which use predicted labels and…

Machine Learning · Computer Science 2021-06-21 Shuyue Guan , Murray Loew

Cluster analysis is used to explore structure in unlabeled data sets in a wide range of applications. An important part of cluster analysis is validating the quality of computationally obtained clusters. A large number of different internal…

Machine Learning · Statistics 2018-01-10 Masud Moshtaghi , James C. Bezdek , Sarah M. Erfani , Christopher Leckie , James Bailey

Validation plays a crucial role in the clustering process. Many different internal validity indexes exist for the purpose of determining the best clustering solution(s) from a given collection of candidates, e.g., as produced by different…

Machine Learning · Statistics 2026-02-23 Connor Simpson , Ricardo J. G. B. Campello , Elizabeth Stojanovski

Many cluster similarity indices are used to evaluate clustering algorithms, and choosing the best one for a particular task remains an open problem. We demonstrate that this problem is crucial: there are many disagreements among the…

Discrete Mathematics · Computer Science 2021-08-27 Martijn Gösgens , Alexey Tikhonov , Liudmila Prokhorenkova

Internal cluster validity measures (such as the Calinski-Harabasz, Dunn, or Davies-Bouldin indices) are frequently used for selecting the appropriate number of partitions a dataset should be split into. In this paper we consider what…

Machine Learning · Statistics 2022-08-31 Marek Gagolewski , Maciej Bartoszuk , Anna Cena

Internal clustering validity indices (ICVIs) assess clustering quality without ground truth labels. Comparative studies consistently find that no single ICVI outperforms others across datasets, leaving practitioners without principled ICVI…

Machine Learning · Computer Science 2025-12-08 Isabella Degen , Zahraa S Abdallah , Kate Robson Brown , Henry W J Reeve

We derive a new Bayesian Information Criterion (BIC) by formulating the problem of estimating the number of clusters in an observed data set as maximization of the posterior probability of the candidate models. Given that some mild…

Statistics Theory · Mathematics 2018-08-28 Freweyni K. Teklehaymanot , Michael Muma , Abdelhak M. Zoubir

With the inclusion of smart meters, electricity load consumption data can be fetched for individual consumer buildings at high temporal resolutions. Availability of such data has made it possible to study daily load demand profiles of the…

Computers and Society · Computer Science 2021-08-04 Mayank Jain , Mukta Jain , Tarek AlSkaif , Soumyabrata Dev

A new cluster validity index is proposed for fuzzy clusters obtained from fuzzy c-means algorithm. The proposed validity index exploits inter-cluster proximity between fuzzy clusters. Inter-cluster proximity is used to measure the degree of…

Artificial Intelligence · Computer Science 2024-07-10 Dae-Won Kim , Kwang H. Lee

A key issue in cluster analysis is the choice of an appropriate clustering method and the determination of the best number of clusters. Different clusterings are optimal on the same data set according to different criteria, and the choice…

Methodology · Statistics 2020-06-24 Serhat Emre Akhanli , Christian Hennig

A major challenge in cluster analysis is that the number of data clusters is mostly unknown and it must be estimated prior to clustering the observed data. In real-world applications, the observed data is often subject to heavy tailed noise…

Machine Learning · Statistics 2020-05-06 Freweyni K. Teklehaymanot , Michael Muma , Abdelhak M. Zoubir

Cluster validity indexes are very important tools designed for two purposes: comparing the performance of clustering algorithms and determining the number of clusters that best fits the data. These indexes are in general constructed by…

Machine Learning · Computer Science 2018-12-24 Ahmed Ben Said , Rachid Hadjidj , Sebti Foufou

Cluster analysis is widely used in the areas of machine learning and data mining. Fuzzy clustering is a particular method that considers that a data point can belong to more than one cluster. Fuzzy clustering helps obtain flexible clusters,…

Machine Learning · Computer Science 2018-06-06 Aybükë Oztürk , Stéphane Lallich , Jérôme Darmont

Validation is one of the most important aspects of clustering, but most approaches have been batch methods. Recently, interest has grown in providing incremental alternatives. This paper extends the incremental cluster validity index (iCVI)…

Machine Learning · Computer Science 2019-02-19 Leonardo Enzo Brito da Silva , Niklas M. Melton , Donald C. Wunsch

Data clustering involves identifying latent similarities within a dataset and organizing them into clusters or groups. The outcomes of various clustering algorithms differ as they are susceptible to the intrinsic characteristics of the…

Machine Learning · Computer Science 2024-07-31 Bryar A. Hassan , Noor Bahjat Tayfor , Alla A. Hassan , Aram M. Ahmed , Tarik A. Rashid , Naz N. Abdalla
‹ Prev 1 2 3 10 Next ›