Related papers: Neural Clustering Processes

Fast Learning of Clusters and Topics via Sparse Posteriors

Mixture models and topic models generate each observation from a single cluster, but standard variational posteriors for each observation assign positive probability to all possible clusters. This requires dense storage and runtime costs…

Machine Learning · Statistics 2017-11-15 Michael C. Hughes , Erik B. Sudderth

Identifying Mixtures of Mixtures Using Bayesian Estimation

The use of a finite mixture of normal distributions in model-based clustering allows to capture non-Gaussian data clusters. However, identifying the clusters from the normal components is challenging and in general either achieved by…

Methodology · Statistics 2016-06-21 Gertraud Malsiner-Walli , Sylvia Frühwirth-Schnatter , Bettina Grün

Model-based Clustering

Mixture models extend the toolbox of clustering methods available to the data analyst. They allow for an explicit definition of the cluster shapes and structure within a probabilistic framework and exploit estimation and inference…

Methodology · Statistics 2025-09-15 Bettina Grün

Learning Neural Models for End-to-End Clustering

We propose a novel end-to-end neural network architecture that, once trained, directly outputs a probabilistic clustering of a batch of input examples in one pass. It estimates a distribution over the number of clusters $k$, and for each $1…

Machine Learning · Computer Science 2018-07-12 Benjamin Bruno Meier , Ismail Elezi , Mohammadreza Amirian , Oliver Durr , Thilo Stadelmann

Probabilistic Partitive Partitioning (PPP)

Clustering is a NP-hard problem. Thus, no optimal algorithm exists, heuristics are applied to cluster the data. Heuristics can be very resource-intensive, if not applied properly. For substantially large data sets computational efficiencies…

Databases · Computer Science 2020-03-11 Mujahid Sultan

Neural Mixture Models with Expectation-Maximization for End-to-end Deep Clustering

Any clustering algorithm must synchronously learn to model the clusters and allocate data to those clusters in the absence of labels. Mixture model-based methods model clusters with pre-defined statistical distributions and allocate data to…

Machine Learning · Computer Science 2022-10-04 Dumindu Tissera , Kasun Vithanage , Rukshan Wijesinghe , Alex Xavier , Sanath Jayasena , Subha Fernando , Ranga Rodrigo

Scalable Clustering: Large Scale Unsupervised Learning of Gaussian Mixture Models with Outliers

Clustering is a widely used technique with a long and rich history in a variety of areas. However, most existing algorithms do not scale well to large datasets, or are missing theoretical guarantees of convergence. This paper introduces a…

Machine Learning · Statistics 2024-10-16 Yijia Zhou , Kyle A. Gallivan , Adrian Barbu

Neural network-based clustering using pairwise constraints

This paper presents a neural network-based end-to-end clustering framework. We design a novel strategy to utilize the contrastive criteria for pushing data-forming clusters directly from raw data, in addition to learning a feature embedding…

Machine Learning · Computer Science 2016-04-27 Yen-Chang Hsu , Zsolt Kira

Without Pain -- Clustering Categorical Data Using a Bayesian Mixture of Finite Mixtures of Latent Class Analysis Models

We propose a Bayesian approach for model-based clustering of multivariate categorical data where variables are allowed to be associated within clusters and the number of clusters is unknown. The approach uses a two-layer mixture of finite…

Methodology · Statistics 2024-07-09 Gertraud Malsiner-Walli , Bettina Grün , Sylvia Frühwirth-Schnatter

Uncertainty Quantification in Bayesian Clustering

Bayesian clustering methods have the widely touted advantage of providing a probabilistic characterization of uncertainty in clustering through the posterior distribution. An amazing variety of priors and likelihoods have been proposed for…

Methodology · Statistics 2025-11-21 Garritt L. Page , Andrés F. Barrientos , David B. Dahl , David B. Dunson

Neural Collaborative Subspace Clustering

We introduce the Neural Collaborative Subspace Clustering, a neural model that discovers clusters of data points drawn from a union of low-dimensional subspaces. In contrast to previous attempts, our model runs without the aid of spectral…

Computer Vision and Pattern Recognition · Computer Science 2019-04-25 Tong Zhang , Pan Ji , Mehrtash Harandi , Wenbing Huang , Hongdong Li

Clustering by Constructing Hyper-Planes

As a kind of basic machine learning method, clustering algorithms group data points into different categories based on their similarity or distribution. We present a clustering algorithm by finding hyper-planes to distinguish the data…

Computer Vision and Pattern Recognition · Computer Science 2020-04-28 Luhong Diao , Jinying Gao1 , Manman Deng

Relabelling Algorithms for Large Dataset Mixture Models

Mixture models are flexible tools in density estimation and classification problems. Bayesian estimation of such models typically relies on sampling from the posterior distribution using Markov chain Monte Carlo. Label switching arises…

Applications · Statistics 2014-03-11 Wanchuang Zhu , Yanan Fan

Model-based clustering of multiple networks with a hierarchical algorithm

The paper tackles the problem of clustering multiple networks, directed or not, that do not share the same set of vertices, into groups of networks with similar topology. A statistical model-based approach based on a finite mixture of…

Statistics Theory · Mathematics 2023-11-07 Tabea Rebafka

Consistency of Graphical Model-based Clustering: Robust Clustering using Bayesian Spanning Forest

Mixture model-based frameworks are very popular for statistical inference in clustering. While convenient for producing probabilistic estimates of cluster assignments and uncertainty, they are prone to misspecification, which can lead to…

Statistics Theory · Mathematics 2026-05-15 Yu Zheng , Leo L. Duan , Arkaprava Roy

Cluster Quilting: Spectral Clustering for Patchwork Learning

Patchwork learning arises as a new and challenging data collection paradigm where both samples and features are observed in fragmented subsets. Due to technological limits, measurement expense, or multimodal data integration, such patchwork…

Methodology · Statistics 2024-06-21 Lili Zheng , Andersen Chang , Genevera I. Allen

Clustering Approaches for Mixed-Type Data: A Comparative Study

Clustering is widely used in unsupervised learning to find homogeneous groups of observations within a dataset. However, clustering mixed-type data remains a challenge, as few existing approaches are suited for this task. This study…

Machine Learning · Statistics 2025-11-26 Badih Ghattas , Alvaro Sanchez San-Benito

A Statistical Approach to Increase Classification Accuracy in Supervised Learning Algorithms

Probabilistic mixture models have been widely used for different machine learning and pattern recognition tasks such as clustering, dimensionality reduction, and classification. In this paper, we focus on trying to solve the most common…

Machine Learning · Computer Science 2020-04-08 Gustavo A Valencia-Zapata , Daniel Mejia , Gerhard Klimeck , Michael Zentner , Okan Ersoy

Clustering of heterogeneous populations of networks

Statistical methods for reconstructing networks from repeated measurements typically assume that all measurements are generated from the same underlying network structure. This need not be the case, however. People's social networks might…

Social and Information Networks · Computer Science 2022-01-25 Jean-Gabriel Young , Alec Kirkley , M. E. J. Newman

Clustering consistency with Dirichlet process mixtures

Dirichlet process mixtures are flexible non-parametric models, particularly suited to density estimation and probabilistic clustering. In this work we study the posterior distribution induced by Dirichlet process mixtures as the sample size…

Statistics Theory · Mathematics 2022-11-29 Filippo Ascolani , Antonio Lijoi , Giovanni Rebaudo , Giacomo Zanella