Related papers: State-Space Dynamics Distance for Clustering Seque…

Fusion Subspace Clustering for Incomplete Data

This paper introduces {\em fusion subspace clustering}, a novel method to learn low-dimensional structures that approximate large scale yet highly incomplete data. The main idea is to assign each datum to a subspace of its own, and minimize…

Machine Learning · Computer Science 2022-05-24 Usman Mahmood , Daniel Pimentel-Alarcón

An expressive dissimilarity measure for relational clustering using neighbourhood trees

Clustering is an underspecified task: there are no universal criteria for what makes a good clustering. This is especially true for relational data, where similarity can be based on the features of individuals, the relationships between…

Machine Learning · Statistics 2017-09-29 Sebastijan Dumancic , Hendrik Blockeel

Clustering by latent dimensions

This paper introduces a new clustering technique, called {\em dimensional clustering}, which clusters each data point by its latent {\em pointwise dimension}, which is a measure of the dimensionality of the data set local to that point.…

Machine Learning · Statistics 2018-05-29 Shohei Hidaka , Neeraj Kashyap

Clustering Time-Series by a Novel Slope-Based Similarity Measure Considering Particle Swarm Optimization

Recently there has been an increase in the studies on time-series data mining specifically time-series clustering due to the vast existence of time-series in various domains. The large volume of data in the form of time-series makes it…

Machine Learning · Computer Science 2019-12-06 Hossein Kamalzadeh , Abbas Ahmadi , Saeed Mansour

Review and Perspective for Distance Based Trajectory Clustering

In this paper we tackle the issue of clustering trajectories of geolocalized observations. Using clustering technics based on the choice of a distance between the observations, we first provide a comprehensive review of the different…

Machine Learning · Statistics 2015-08-21 Philippe Besse , Brendan Guillouet , Jean-Michel Loubes , Royer François

Clustering is spotting pattern in a group of objects and resultantly grouping the similar objects together. Objects have attributes which are not always numerical, sometimes attributes have domain or categories to which they could belong…

Machine Learning · Computer Science 2020-11-20 Utkarsh Nath , Shikha Asrani , Rahul Katarya

A New Homogeneity Inter-Clusters Measure in SemiSupervised Clustering

Many studies in data mining have proposed a new learning called semi-Supervised. Such type of learning combines unlabeled and labeled data which are hard to obtain. However, in unsupervised methods, the only unlabeled data are used. The…

Machine Learning · Computer Science 2013-04-16 Badreddine Meftahi , Ourida Ben Boubaker Saidi

Context Steering: A New Paradigm for Compression-based Embeddings by Synthesizing Relevant Information Features

Compression-based dissimilarities (CD) offer a flexible and domain-agnostic means of measuring similarity by identifying implicit information through redundancies between data objects. However, as similarity features are derived from the…

Machine Learning · Computer Science 2026-05-13 Guillermo Sarasa , Ana Granados , Francisco de Borja Rodríguez

Distance for Functional Data Clustering Based on Smoothing Parameter Commutation

We propose a novel method to determine the dissimilarity between subjects for functional data clustering. Spline smoothing or interpolation is common to deal with data of such type. Instead of estimating the best-representing curve for each…

Methodology · Statistics 2021-03-23 ShengLi Tzeng , Christian Hennig , Yu-Fen Li , Chien-Ju Lin

Cluster-state quantum computation

This article is a short introduction to and review of the cluster-state model of quantum computation, in which coherent quantum information processing is accomplished via a sequence of single-qubit measurements applied to a fixed quantum…

Quantum Physics · Physics 2009-11-11 Michael A. Nielsen

Quantifying Distances Between Clusters with Elliptical or Non-Elliptical Shapes

Finite mixture models that allow for a broad range of potentially non-elliptical cluster distributions is an emerging methodological field. Such methods allow for the shape of the clusters to match the natural heterogeneity of the data,…

Methodology · Statistics 2022-06-24 Meredith L. Wallace , Lisa McTeague , Jessica L. Graves , Nicholas Kissel , Cristina Tortora , Bradley Wheeler , Satish Iyengar

The Exploitation of Distance Distributions for Clustering

Although distance measures are used in many machine learning algorithms, the literature on the context-independent selection and evaluation of distance measures is limited in the sense that prior knowledge is used. In cluster analysis,…

Machine Learning · Computer Science 2021-08-24 Michael C. Thrun

Estimating Sequence Similarity from Read Sets for Clustering Next-Generation Sequencing data

To cluster sequences given only their read-set representations, one may try to reconstruct each one from the corresponding read set, and then employ conventional (dis)similarity measures such as the edit distance on the assembled sequences.…

Data Structures and Algorithms · Computer Science 2017-05-18 Petr Ryšavý , Filip Železný

From Time Series to Euclidean Spaces: On Spatial Transformations for Temporal Clustering

Unsupervised clustering of temporal data is both challenging and crucial in machine learning. In this paper, we show that neither traditional clustering methods, time series specific or even deep learning-based alternatives generalise well…

Machine Learning · Computer Science 2020-10-13 Nuno Mota Goncalves , Ioana Giurgiu , Anika Schumann

Categorical Data Clustering via Value Order Estimated Distance Metric Learning

Clustering is a popular machine learning technique for data mining that can process and analyze datasets to automatically reveal sample distribution patterns. Since the ubiquitous categorical data naturally lack a well-defined metric space…

Machine Learning · Computer Science 2025-09-01 Yiqun Zhang , Mingjie Zhao , Hong Jia , Yang Lu , Mengke Li , Yiu-ming Cheung

CADM: Cluster-customized Adaptive Distance Metric for Categorical Data Clustering

An appropriate distance metric is crucial for categorical data clustering, as the distance between categorical data cannot be directly calculated. However, the distances between attribute values usually vary in different clusters induced by…

Machine Learning · Computer Science 2026-03-09 Taixi Chen , Yiu-ming Cheung , Yiqun Zhang

This paper introduces a new unsupervised method for the clustering of physiological data into health states based on their similarity. We propose an iterative hierarchical clustering approach that combines health states according to a…

Signal Processing · Electrical Eng. & Systems 2018-03-28 Fabian Schrumpf , Gerold Bausch , Matthias Sturm , Mirco Fuchs

Bayesian Distance Clustering

Model-based clustering is widely-used in a variety of application areas. However, fundamental concerns remain about robustness. In particular, results can be sensitive to the choice of kernel representing the within-cluster data density.…

Machine Learning · Statistics 2019-06-27 Leo L Duan , David B Dunson

Fusion Subspace Clustering: Full and Incomplete Data

Modern inference and learning often hinge on identifying low-dimensional structures that approximate large scale data. Subspace clustering achieves this through a union of linear subspaces. However, in contemporary applications data is…

Machine Learning · Computer Science 2018-08-03 Daniel L. Pimentel-Alarcón , Usman Mahmood

Generalization of Clustering Agreements and Distances for Overlapping Clusters and Network Communities

A measure of distance between two clusterings has important applications, including clustering validation and ensemble clustering. Generally, such distance measure provides navigation through the space of possible clusterings. Mostly used…

Social and Information Networks · Computer Science 2015-09-01 Reihaneh Rabbany , Osmar R. Zaïane