English
Related papers

Related papers: DeBaCl: A Python Package for Interactive DEnsity-B…

200 papers

We propose methods for the analysis of hierarchical clustering that fully use the multi-resolution structure provided by a dendrogram. Specifically, we propose a loss for choosing between clustering methods, a feature importance score and a…

Methodology · Statistics 2023-01-31 Luben M. C. Cabezas , Rafael Izbicki , Rafael B. Stern

A new strategy is proposed for building easy to interpret predictive models in the context of a high-dimensional dataset, with a large number of highly correlated explanatory variables. The strategy is based on a first step of variables…

Applications · Statistics 2023-07-14 Evelyne Vigneau

Hierarchical clustering is one of the standard methods taught for identifying and exploring the underlying structures that may be present within a data set. Students are shown examples in which the dendrogram, a visual representation of the…

Other Statistics · Statistics 2022-06-06 Andee Kaplan , Jacob Bien

This paper focuses on density-based clustering, particularly the Density Peak (DP) algorithm and the one based on density-connectivity DBSCAN; and proposes a new method which takes advantage of the individual strengths of these two methods…

Machine Learning · Computer Science 2024-01-30 Ye Zhu , Kai Ming Ting , Yuan Jin , Maia Angelova

Clustering is a well-known and studied problem, one of its variants, called contiguity-constrained clustering, accepts as a second input a graph used to encode prior information about cluster structure by means of contiguity constraints…

Computation · Statistics 2023-02-27 Etienne Côme

Previously, we proposed a physically-inspired method to construct data points into an effective in-tree (IT) structure, in which the underlying cluster structure in the dataset is well revealed. Although there are some edges in the IT…

Machine Learning · Statistics 2015-07-30 Teng Qiu , Yongjie Li

Classically, Bayesian clustering interprets each component of a mixture model as a cluster. The inferred clustering posterior is highly sensitive to any inaccuracies in the kernel within each component. As this kernel is made more flexible,…

Methodology · Statistics 2025-12-12 David Buch , Miheer Dewaskar , David B. Dunson

Cluster analysis, or clustering, plays a crucial role across numerous scientific and engineering domains. Despite the wealth of clustering methods proposed over the past decades, each method is typically designed for specific scenarios and…

Methodology · Statistics 2026-01-22 Siyi Wang , Alexandre Leblanc , Paul D. McNicholas

We introduce a cluster evaluation technique called Tree Index. Our Tree Index algorithm aims at describing the structural information of the clustering rather than the quantitative format of cluster-quality indexes (where the representation…

Machine Learning · Computer Science 2020-03-25 A. H. Beg , Md Zahidul Islam , Vladimir Estivill-Castro

How can we find a good graph clustering of a real-world network, that allows insight into its underlying structure and also potential functions? In this paper, we introduce a new graph clustering algorithm Dcut from a density point of view.…

Social and Information Networks · Computer Science 2016-06-06 Junming Shao , Qinli Yang , Jinhu Liu , Stefan Kramer

Density Based Clustering are a type of Clustering methods using in data mining for extracting previously unknown patterns from data sets. There are a number of density based clustering methods such as DBSCAN, OPTICS, DENCLUE, VDBSCAN,…

Machine Learning · Computer Science 2023-07-25 Rupanka Bhuyan , Samarjeet Borah

Hierarchical clustering is an effective, interpretable method for analyzing structure in data. It reveals insights at multiple scales without requiring a predefined number of clusters and captures nested patterns and subtle relationships,…

Finite mixture modelling provides a framework for cluster analysis based on parsimonious Gaussian mixture models. Variable or feature selection is of particular importance in situations where only a subset of the available variables provide…

Computation · Statistics 2014-11-04 Luca Scrucca , Adrian E. Raftery

In this paper, we present a new R package COREclust dedicated to the detection of representative variables in high dimensional spaces with a potentially limited number of observations. Variable sets detection is based on an original graph…

Mathematical Software · Computer Science 2018-05-28 Camille Champion , Anne-Claire Brunet , Jean-Michel Loubes , Laurent Risser

Identifying possible clusters in datasets and estimating their overall modularity are central tasks in pattern recognition. In the present work, concepts and methodologies are described for performing these tasks while considering only the…

Physics and Society · Physics 2026-05-27 Alexandre Benatti , Luciano da F. Costa

"mdendro" is an R package that provides a comprehensive collection of linkage methods for agglomerative hierarchical clustering on a matrix of proximity data (distances or similarities), returning a multifurcated dendrogram or…

Information Retrieval · Computer Science 2025-09-16 Alberto Fernández , Sergio Gómez

partycls is a Python framework for cluster analysis of systems of interacting particles. By grouping particles that share similar structural or dynamical properties, partycls enables rapid and unsupervised exploration of the system's…

Computational Physics · Physics 2021-11-22 Joris Paret , Daniele Coslovich

Techniques for clustering student behaviour offer many opportunities to improve educational outcomes by providing insight into student learning. However, one important aspect of student behaviour, namely its evolution over time, can often…

Machine Learning · Computer Science 2021-10-08 Jessica McBroom , Kalina Yacef , Irena Koprinska

Clustering of longitudinal data is used to explore common trends among subjects over time for a numeric measurement of interest. Various R packages have been introduced throughout the years for identifying clusters of longitudinal patterns,…

Machine Learning · Computer Science 2024-02-23 Niek Den Teuling , Steffen Pauws , Edwin van den Heuvel

When some 'entities' are related by the 'features' they share they are amenable to a bipartite network representation. Plant-pollinator ecological communities, co-authorship of scientific papers, customers and purchases, or answers in a…

Social and Information Networks · Computer Science 2020-10-14 Ignacio Tamarit , María Pereda , José A. Cuesta
‹ Prev 1 2 3 10 Next ›