Related papers: Transformers can do Bayesian Clustering

Transformers Can Do Bayesian Inference

Currently, it is hard to reap the benefits of deep learning for Bayesian methods, which allow the explicit specification of prior knowledge and accurately capture model uncertainty. We present Prior-Data Fitted Networks (PFNs). PFNs…

Machine Learning · Computer Science 2024-08-14 Samuel Müller , Noah Hollmann , Sebastian Pineda Arango , Josif Grabocka , Frank Hutter

TabClustPFN: A Prior-Fitted Network for Tabular Data Clustering

Clustering tabular data is a fundamental yet challenging problem due to heterogeneous feature types, diverse data-generating mechanisms, and the absence of transferable inductive biases across datasets. Prior-fitted networks (PFNs) have…

Machine Learning · Computer Science 2026-05-15 Tianqi Zhao , Guanyang Wang , Yan Shuo Tan , Qiong Zhang

Clustering by Attention: Leveraging Prior Fitted Transformers for Data Partitioning

Clustering is a core task in machine learning with wide-ranging applications in data mining and pattern recognition. However, its unsupervised nature makes it inherently challenging. Many existing clustering algorithms suffer from critical…

Machine Learning · Computer Science 2025-07-29 Ahmed Shokry , Ayman Khalafallah

Distribution Transformers: Fast Approximate Bayesian Inference With On-The-Fly Prior Adaptation

While Bayesian inference provides a principled framework for reasoning under uncertainty, its widespread adoption is limited by the intractability of exact posterior computation, necessitating the use of approximate inference. However,…

Machine Learning · Statistics 2026-05-19 George Whittle , Juliusz Ziomek , Jacob Rawling , Maike A. Osborne

Transformers Can Learn Posterior Predictive Distributions In-Context

Prior-data fitted networks (PFNs) have recently emerged as a powerful approach for Bayesian prediction tasks, approximating the posterior predictive distribution (PPD) through in-context learning. Despite their strong empirical performance…

Machine Learning · Statistics 2026-05-27 Gyeonghun Kang , Changwoo J. Lee , Xiang Cheng

Bayesian Level Set Clustering

Classically, Bayesian clustering interprets each component of a mixture model as a cluster. The inferred clustering posterior is highly sensitive to any inaccuracies in the kernel within each component. As this kernel is made more flexible,…

Methodology · Statistics 2025-12-12 David Buch , Miheer Dewaskar , David B. Dunson

Advances in Bayesian random partition models: A comprehensive review

Clustering is a crucial task in various domains of knowledge, including medicine, epidemiology, genomics, environmental science, economics, and visual sciences, among others. Methodologies for inferring the number of clusters have often…

Methodology · Statistics 2025-05-26 Clara Grazian

Position: The Future of Bayesian Prediction Is Prior-Fitted

Training neural networks on randomly generated artificial datasets yields Bayesian models that capture the prior defined by the dataset-generating distribution. Prior-data Fitted Networks (PFNs) are a class of methods designed to leverage…

Machine Learning · Computer Science 2025-06-02 Samuel Müller , Arik Reuter , Noah Hollmann , David Rügamer , Frank Hutter

Uncertainty Quantification for Prior-Data Fitted Networks using Martingale Posteriors

Prior-data fitted networks (PFNs) have emerged as promising foundation models for prediction from tabular datasets, achieving state-of-the-art performance on small to moderate data sizes without tuning. While PFNs are motivated by Bayesian…

Methodology · Statistics 2026-05-11 Thomas Nagler , David Rügamer

Bayesian Clustering via Fusing of Localized Densities

Bayesian clustering typically relies on mixture models, with each component interpreted as a different cluster. After defining a prior for the component parameters and weights, Markov chain Monte Carlo (MCMC) algorithms are commonly used to…

Methodology · Statistics 2024-07-30 Alexander Dombowsky , David B. Dunson

Efficient Bayesian Learning Curve Extrapolation using Prior-Data Fitted Networks

Learning curve extrapolation aims to predict model performance in later epochs of training, based on the performance in earlier epochs. In this work, we argue that, while the inherent uncertainty in the extrapolation of learning curves…

Machine Learning · Computer Science 2023-11-01 Steven Adriaensen , Herilalaina Rakotoarison , Samuel Müller , Frank Hutter

Bayesian Finite Mixture Models

Finite mixture models are a useful statistical model class for clustering and density approximation. In the Bayesian framework finite mixture models require the specification of suitable priors in addition to the data model. These priors…

Methodology · Statistics 2024-07-09 Bettina Grün , Gertraud Malsiner-Walli

Uncertainty Quantification in Bayesian Clustering

Bayesian clustering methods have the widely touted advantage of providing a probabilistic characterization of uncertainty in clustering through the posterior distribution. An amazing variety of priors and likelihoods have been proposed for…

Methodology · Statistics 2025-11-21 Garritt L. Page , Andrés F. Barrientos , David B. Dahl , David B. Dunson

Approximate Inference via Clustering

In recent years, large-scale Bayesian learning draws a great deal of attention. However, in big-data era, the amount of data we face is growing much faster than our ability to deal with it. Fortunately, it is observed that large-scale…

Machine Learning · Computer Science 2022-02-15 Qianqian Song

Fair Bayesian Model-Based Clustering

Fair clustering has become a socially significant task with the advancement of machine learning technologies and the growing demand for trustworthy AI. Group fairness ensures that the proportions of each sensitive group are similar in all…

Machine Learning · Statistics 2025-06-17 Jihu Lee , Kunwoong Kim , Yongdai Kim

Cluster expansion by transfer learning for phase stability predictions

Recent progress towards universal machine-learned interatomic potentials holds considerable promise for materials discovery. Yet the accuracy of these potentials for predicting phase stability may still be limited. In contrast, cluster…

Materials Science · Physics 2024-05-03 A. Dana , L. Mu , S. Gelin , S. B. Sinnott , I. Dabo

Machine Learning and the Future of Bayesian Computation

Bayesian models are a powerful tool for studying complex data, allowing the analyst to encode rich hierarchical dependencies and leverage prior information. Most importantly, they facilitate a complete characterization of uncertainty…

Machine Learning · Statistics 2023-04-25 Steven Winter , Trevor Campbell , Lizhen Lin , Sanvesh Srivastava , David B. Dunson

Prior-Fitted Networks Scale to Larger Datasets When Treated as Weak Learners

Prior-Fitted Networks (PFNs) have recently been proposed to efficiently perform tabular classification tasks. Although they achieve good performance on small datasets, they encounter limitations with larger datasets. These limitations…

Machine Learning · Computer Science 2025-03-04 Yuxin Wang , Botian Jiang , Yiran Guo , Quan Gan , David Wipf , Xuanjing Huang , Xipeng Qiu

Clustering Multivariate Data using Factor Analytic Bayesian Mixtures with an Unknown Number of Components

Recent work on overfitting Bayesian mixtures of distributions offers a powerful framework for clustering multivariate data using a latent Gaussian model which resembles the factor analysis model. The flexibility provided by overfitting…

Methodology · Statistics 2019-08-29 Panagiotis Papastamoulis

Consistency of Graphical Model-based Clustering: Robust Clustering using Bayesian Spanning Forest

Mixture model-based frameworks are very popular for statistical inference in clustering. While convenient for producing probabilistic estimates of cluster assignments and uncertainty, they are prone to misspecification, which can lead to…

Statistics Theory · Mathematics 2026-05-15 Yu Zheng , Leo L. Duan , Arkaprava Roy