Related papers: Visualization tools for parameter selection in clu…

Evaluating network partitions through visualization

Network clustering requires making many decisions manually, such as the number of groups and a statistical model to be used. Even after filtering using an information criterion or regularizing with a nonparametric framework, we are commonly…

Social and Information Networks · Computer Science 2019-06-05 Chihiro Noguchi , Tatsuro Kawamoto

Clustering and visualization tools to study high dimensional parameter spaces: B anomalies example

We describe the applications of clustering and visualization tools using the so-called neutral B anomalies as an example. Clustering permits parameter space partitioning into regions that can be separated with some given measurements. It…

Data Analysis, Statistics and Probability · Physics 2023-04-04 Ursula Laa , German Valencia

A Serial Multilevel Hypergraph Partitioning Algorithm

The graph partitioning problem has many applications in scientific computing such as computer aided design, data mining, image compression and other applications with sparse-matrix vector multiplications as a kernel operation. In many cases…

Data Structures and Algorithms · Computer Science 2016-01-08 Foad Lotfifar , Matthew Johnson

Visualizing Overlapping Biclusterings and Boolean Matrix Factorizations

Finding (bi-)clusters in bipartite graphs is a popular data analysis approach. Analysts typically want to visualize the clusters, which is simple as long as the clusters are disjoint. However, many modern algorithms find overlapping…

Machine Learning · Computer Science 2023-07-17 Thibault Marette , Pauli Miettinen , Stefan Neumann

Clustering by Constructing Hyper-Planes

As a kind of basic machine learning method, clustering algorithms group data points into different categories based on their similarity or distribution. We present a clustering algorithm by finding hyper-planes to distinguish the data…

Computer Vision and Pattern Recognition · Computer Science 2020-04-28 Luhong Diao , Jinying Gao1 , Manman Deng

KPIs-Based Clustering and Visualization of HPC jobs: a Feature Reduction Approach

High-Performance Computing (HPC) systems need to be constantly monitored to ensure their stability. The monitoring systems collect a tremendous amount of data about different parameters or Key Performance Indicators (KPIs), such as resource…

Artificial Intelligence · Computer Science 2023-12-12 Mohamed Soliman Halawa , Rebeca P. Díaz-Redondo , Ana Fernández-Vilas

Combining clustering of variables and feature selection using random forests

Standard approaches to tackle high-dimensional supervised classification problem often include variable selection and dimension reduction procedures. The novel methodology proposed in this paper combines clustering of variables and feature…

Statistics Theory · Mathematics 2018-11-07 Marie Chavent , Robin Genuer , Jerome Saracco

Neural Clustering based Visual Representation Learning

We investigate a fundamental aspect of machine vision: the measurement of features, by revisiting clustering, one of the most classic approaches in machine learning and data analysis. Existing visual feature extractors, including ConvNets,…

Computer Vision and Pattern Recognition · Computer Science 2024-03-27 Guikun Chen , Xia Li , Yi Yang , Wenguan Wang

StruClus: Structural Clustering of Large-Scale Graph Databases

We present a structural clustering algorithm for large-scale datasets of small labeled graphs, utilizing a frequent subgraph sampling strategy. A set of representatives provides an intuitive description of each cluster, supports the…

Databases · Computer Science 2016-10-03 Till Schäfer , Petra Mutzel

Ultrametric Cluster Hierarchies: I Want 'em All!

Hierarchical clustering is a powerful tool for exploratory data analysis, organizing data into a tree of clusterings from which a partition can be chosen. This paper generalizes these ideas by proving that, for any reasonable hierarchy, one…

Machine Learning · Computer Science 2025-11-13 Andrew Draganov , Pascal Weber , Rasmus Skibdahl Melanchton Jørgensen , Anna Beer , Claudia Plant , Ira Assent

HERCULES: Hierarchical Embedding-based Recursive Clustering Using LLMs for Efficient Summarization

The explosive growth of complex datasets across various modalities necessitates advanced analytical tools that not only group data effectively but also provide human-understandable insights into the discovered structures. We introduce…

Machine Learning · Computer Science 2025-09-04 Gabor Petnehazi , Bernadett Aradi

Feature Screening in Large Scale Cluster Analysis

We propose a novel methodology for feature screening in clustering massive datasets, in which both the number of features and the number of observations can potentially be very large. Taking advantage of a fusion penalization based convex…

Methodology · Statistics 2017-10-05 Trambak Banerjee , Gourab Mukherjee , Peter Radchenko

Parameterized Correlation Clustering in Hypergraphs and Bipartite Graphs

Motivated by applications in community detection and dense subgraph discovery, we consider new clustering objectives in hypergraphs and bipartite graphs. These objectives are parameterized by one or more resolution parameters in order to…

Data Structures and Algorithms · Computer Science 2020-06-22 Nate Veldt , Anthony Wirth , David F. Gleich

Cluster Identification and Characterization of Physical Fields

The description of complex configuration is a difficult issue. We present a powerful technique for cluster identification and characterization. The scheme is designed to treat with and analyze the experimental and/or simulation data from…

Statistical Mechanics · Physics 2013-08-29 Guangcai Zhang , Aiguo Xu , Guo Lu , Zeyao Mo

Model-Based Hierarchical Clustering

We present an approach to model-based hierarchical clustering by formulating an objective function based on a Bayesian analysis. This model organizes the data into a cluster hierarchy while specifying a complex feature-set partitioning that…

Machine Learning · Computer Science 2013-01-18 Shivakumar Vaithyanathan , Byron E Dom

Realtime Hierarchical Clustering based on Boundary and Surface Statistics

Visual grouping is a key mechanism in human scene perception. There, it belongs to the subconscious, early processing and is key prerequisite for other high level tasks such as recognition. In this paper, we introduce an efficient, realtime…

Computer Vision and Pattern Recognition · Computer Science 2016-09-23 Dominik Alexander Klein , Dirk Schulz , Armin Bernd Cremers

Clustering with fair-center representation: parameterized approximation algorithms and heuristics

We study a variant of classical clustering formulations in the context of algorithmic fairness, known as diversity-aware clustering. In this variant we are given a collection of facility subsets, and a solution must contain at least a…

Data Structures and Algorithms · Computer Science 2022-10-25 Suhas Thejaswi , Ameet Gadekar , Bruno Ordozgoiti , Michal Osadnik

Clustrophile: A Tool for Visual Clustering Analysis

While clustering is one of the most popular methods for data mining, analysts lack adequate tools for quick, iterative clustering analysis, which is essential for hypothesis generation and data reasoning. We introduce Clustrophile, an…

Human-Computer Interaction · Computer Science 2017-10-09 Çağatay Demiralp

Hierarchical clustering: visualization, feature importance and model selection

We propose methods for the analysis of hierarchical clustering that fully use the multi-resolution structure provided by a dendrogram. Specifically, we propose a loss for choosing between clustering methods, a feature importance score and a…

Methodology · Statistics 2023-01-31 Luben M. C. Cabezas , Rafael Izbicki , Rafael B. Stern

A graphical heuristic for reduction and partitioning of large datasets for scalable supervised training

A scalable graphical method is presented for selecting, and partitioning datasets for the training phase of a classification task. For the heuristic, a clustering algorithm is required to get its computation cost in a reasonable proportion…

Machine Learning · Computer Science 2019-07-25 Sumedh Yadav , Mathis Bode