English
Related papers

Related papers: A new interpoint distance-based clustering algorit…

200 papers

A new clustering accuracy measure is proposed to determine the unknown number of clusters and to assess the quality of clustering of a data set given in any dimensional space. Our validity index applies the classical nonparametric…

Methodology · Statistics 2022-02-15 Soumita Modak

A new interpoint distance-based measure is proposed to identify the optimal number of clusters present in a data set. Designed in nonparametric approach, it is independent of the distribution of given data. Interpoint distances between the…

Machine Learning · Computer Science 2022-10-18 Soumita Modak

This paper proposes a novel, nonparametric, interpoint distance-based measure to investigate whether there exist any groups in a set of given data, and if so then, how many groups are prevailing in total. It is a cluster accuracy index…

Methodology · Statistics 2026-05-21 Soumita Modak

Semi-supervised clustering is the task of clustering data points into clusters where only a fraction of the points are labelled. The true number of clusters in the data is often unknown and most models require this parameter as an input.…

Machine Learning · Computer Science 2013-09-27 Amar Shah , Zoubin Ghahramani

This paper deals with nonparametric estimation of conditional den-sities in mixture models in the case when additional covariates are available. The proposed approach consists of performing a prelim-inary clustering algorithm on the…

Statistics Theory · Mathematics 2015-02-09 Stéphane Auray , Nicolas Klutchnikoff , Laurent Rouvière

Measuring similarity between two objects is the core operation in existing clustering algorithms in grouping similar objects into clusters. This paper introduces a new similarity measure called point-set kernel which computes the similarity…

Machine Learning · Computer Science 2022-01-07 Kai Ming Ting , Jonathan R. Wells , Ye Zhu

An algorithm to improve performance parameter for unsupervised decision forest clustering and density estimation is presented. Specifically, a dual assignment parameter is introduced as a density estimator by combining Random Forest and…

Computer Vision and Pattern Recognition · Computer Science 2015-07-19 Hayder Albehadili , Naz Islam

We derive and analyze a generic, recursive algorithm for estimating all splits in a finite cluster tree as well as the corresponding clusters. We further investigate statistical properties of this generic clustering algorithm when it…

Machine Learning · Statistics 2021-11-02 Ingo Steinwart , Bharath K. Sriperumbudur , Philipp Thomann

We propose a simple and efficient clustering method for high-dimensional data with a large number of clusters. Our algorithm achieves high-performance by evaluating distances of datapoints with a subset of the cluster centres. Our…

Machine Learning · Computer Science 2022-03-30 Georgios Exarchakis , Omar Oubari , Gregor Lenz

Cluster analysis is an unsupervised learning strategy that can be employed to identify subgroups of observations in data sets of unknown structure. This strategy is particularly useful for analyzing high-dimensional data such as microarray…

Methodology · Statistics 2016-10-07 Erika S. Helgeson , Eric Bair

Cluster analysis which focuses on the grouping and categorization of similar elements is widely used in various fields of research. Inspired by the phenomenon of atomic fission, a novel density-based clustering algorithm is proposed in this…

Machine Learning · Computer Science 2020-04-28 Shizhan Lu

This paper introduces a new clustering technique, called {\em dimensional clustering}, which clusters each data point by its latent {\em pointwise dimension}, which is a measure of the dimensionality of the data set local to that point.…

Machine Learning · Statistics 2018-05-29 Shohei Hidaka , Neeraj Kashyap

Clustering is a technique for the analysis of datasets obtained by empirical studies in several disciplines with a major application for biomedical research. Essentially, clustering algorithms are executed by machines aiming at finding…

Quantitative Methods · Quantitative Biology 2024-09-30 Diego Ulisse Pizzagalli , Santiago Fernandez Gonzalez , Rolf Krause

In this paper, we present a novel non-parametric clustering technique. Our technique is based on the notion that each latent cluster is comprised of layers that surround its core, where the external layers, or border points, implicitly…

Computer Vision and Pattern Recognition · Computer Science 2019-07-02 Hadar Averbuch-Elor , Nadav Bar , Daniel Cohen-Or

A novel formulation of the clustering problem is introduced in which the task is expressed as an estimation problem, where the object to be estimated is a function which maps a point to its distribution of cluster membership. Unlike…

Machine Learning · Computer Science 2025-10-14 David P. Hofmeyr

A new depth-based clustering procedure for directional data is proposed. Such method is fully non-parametric and has the advantages to be flexible and applicable even in high dimensions when a suitable notion of depth is adopted. The…

Methodology · Statistics 2022-06-22 Giuseppe Pandolfo , Antonio D'ambrosio

Density-based clustering relies on the idea of linking groups to some specific features of the probability distribution underlying the data. The reference to a true, yet unknown, population structure allows to frame the clustering problem…

Methodology · Statistics 2021-02-01 Alessandro Casa , José E. Chacón , Giovanna Menardi

Density based spatial clustering of points in $\mathbb{R}^n$ has a myriad of applications in a variety of industries. We generalise this problem to the density based clustering of lines in high-dimensional spaces, keeping in mind there…

Machine Learning · Computer Science 2024-10-04 Akanksha Das , Malay Bhattacharyya

This paper introduces a novel nonparametric criterion for determining the appropriate number of clusters, which is derived from the spatial median. The method is constructed to reconcile two competing objectives of cluster analysis: the…

Computation · Statistics 2025-09-26 Hend Gabr , Brian H Willis , Mohammed Baragilly

Density Estimation is one of the central areas of statistics whose purpose is to estimate the probability density function underlying the observed data. It serves as a building block for many tasks in statistical inference, visualization,…

Machine Learning · Statistics 2019-04-02 Zhipeng Wang , David W. Scott
‹ Prev 1 2 3 10 Next ›