Related papers: On $p$-adic Classification

Algorithmic local monomialization of a binomial: a comparison of different approaches

We investigate different approaches to transform a given binomial into a monomial via blowing up appropriate centers. In particular, we develop explicit implementations in {\sc Singular}, which allow to make a comparison on the basis of…

Algebraic Geometry · Mathematics 2022-10-05 Sabrina Alexandra Gaube , Bernd Schober

Clustering by Constructing Hyper-Planes

As a kind of basic machine learning method, clustering algorithms group data points into different categories based on their similarity or distribution. We present a clustering algorithm by finding hyper-planes to distinguish the data…

Computer Vision and Pattern Recognition · Computer Science 2020-04-28 Luhong Diao , Jinying Gao1 , Manman Deng

A LASSO-Penalized BIC for Mixture Model Selection

The efficacy of family-based approaches to mixture model-based clustering and classification depends on the selection of parsimonious models. Current wisdom suggests the Bayesian information criterion (BIC) for mixture model selection.…

Methodology · Statistics 2013-11-12 Sakyajit Bhattacharya , Paul D. McNicholas

XClusters: Explainability-first Clustering

We study the problem of explainability-first clustering where explainability becomes a first-class citizen for clustering. Previous clustering approaches use decision trees for explanation, but only after the clustering is completed. In…

Machine Learning · Computer Science 2022-12-13 Hyunseung Hwang , Steven Euijong Whang

Differentiable Clustering with Perturbed Spanning Forests

We introduce a differentiable clustering method based on stochastic perturbations of minimum-weight spanning forests. This allows us to include clustering in end-to-end trainable pipelines, with efficient gradients. We show that our method…

Machine Learning · Computer Science 2023-11-07 Lawrence Stewart , Francis S Bach , Felipe Llinares López , Quentin Berthet

Energy-Based Clustering: Fast and Robust Clustering of Data with Known Likelihood Functions

Clustering has become an indispensable tool in the presence of increasingly large and complex data sets. Most clustering algorithms depend, either explicitly or implicitly, on the sampled density. However, estimated densities are fragile…

Chemical Physics · Physics 2023-08-21 Moritz Thürlemann , Sereina Riniker

Data Clustering via Principal Direction Gap Partitioning

We explore the geometrical interpretation of the PCA based clustering algorithm Principal Direction Divisive Partitioning (PDDP). We give several examples where this algorithm breaks down, and suggest a new method, gap partitioning, which…

Machine Learning · Statistics 2012-11-20 Ralph Abbey , Jeremy Diepenbrock , Amy Langville , Carl Meyer , Shaina Race , Dexin Zhou

Note on $p$-adic Local Functional Equation

Given primes $\ell\ne p$, we record here a $p$-adic valued Fourier theory on a local field over $\mathbf{Q}_\ell$, which is developed under the perspective of group schemes. As an application, by substituting rigid analysis for complex…

Number Theory · Mathematics 2022-06-23 Luochen Zhao

Construction of a set of p-adic distributions

In this paper adapting to $p$-adic case some methods of real valued Gibbs measures on Cayley trees we construct several $p$-adic distributions on the set $\mathbb{Z}_p$ of $p$-adic integers. Moreover, we give conditions under which these…

Mathematical Physics · Physics 2018-01-17 U. A. Rozikov , Z. T. Tugyonov

Divisive Hierarchical Clustering of Variables Identified by Singular Vectors

In this work, we introduce a novel methodology for divisive hierarchical clustering. Our divisive (``top-down'') approach is motivated by the fact that agglomerative hierarchical clustering (``bottom-up''), which is commonly used for…

Methodology · Statistics 2025-10-07 Jan O. Bauer

Fast Clustering of Categorical Big Data

The K-Modes algorithm, developed for clustering categorical data, is of high algorithmic simplicity but suffers from unreliable performances in clustering quality and clustering efficiency, both heavily influenced by the choice of initial…

Machine Learning · Computer Science 2025-02-18 Bipana Thapaliya , Yu Zhuang

A tractable Multi-Partitions Clustering

In the framework of model-based clustering, a model allowing several latent class variables is proposed. This model assumes that the distribution of the observed data can be factorized into several independent blocks of variables. Each…

Methodology · Statistics 2018-01-23 Matthieu Marbac , Vincent Vandewalle

Clustering by Attention: Leveraging Prior Fitted Transformers for Data Partitioning

Clustering is a core task in machine learning with wide-ranging applications in data mining and pattern recognition. However, its unsupervised nature makes it inherently challenging. Many existing clustering algorithms suffer from critical…

Machine Learning · Computer Science 2025-07-29 Ahmed Shokry , Ayman Khalafallah

Compact $p$-adic analytic groups in which centralizers are abelian

Using methods of associative algebras, Lie theory, group cohomology, and modular representation theory, we construct profinite $p$-adic analytic groups such that the centralizer of each of their non-trivial elements is abelian. The paper…

Group Theory · Mathematics 2024-11-07 Luis Mendonça , Thomas S. Weigel , Theo Zapata

On a classification of irreducible admissible modulo $p$ representations of a $p$-adic split reductive group

We give a classification of irreducible admissible modulo $p$ representations of a split $p$-adic reductive group in terms of supersingular representations. This is a generalization of a theorem of Herzig.

Representation Theory · Mathematics 2019-02-20 Noriyuki Abe

Model-based clustering for conditionally correlated categorical data

An extension of the latent class model is presented for clustering categorical data by relaxing the classical "class conditional independence assumption" of variables. This model consists in grouping the variables into inter-independent and…

Computation · Statistics 2015-10-01 Matthieu Marbac , Christophe Biernacki , Vincent Vandewalle

A model selection approach for clustering a multinomial sequence with non-negative factorization

We consider a problem of clustering a sequence of multinomial observations by way of a model selection criterion. We propose a form of a penalty term for the model selection procedure. Our approach subsumes both the conventional AIC and BIC…

Machine Learning · Statistics 2015-08-17 Nam H. Lee , Runze Tang , Carey E. Priebe , Michael Rosen

Clustering by connection center evolution

The determination of cluster centers generally depends on the scale that we use to analyze the data to be clustered. Inappropriate scale usually leads to unreasonable cluster centers and thus unreasonable results. In this study, we first…

Machine Learning · Statistics 2016-10-20 Xiurui Geng , Hairong Tang

Fast evaluation of some p-adic transcendental functions

We design algorithms for computing values of many p-adic elementary and special functions, including logarithms, exponentials, polylogarithms, and hypergeometric functions. All our algorithms feature a quasi-linear complexity with respect…

Symbolic Computation · Computer Science 2021-06-18 Xavier Caruso , Marc Mezzarobba , Nobuki Takayama , Tristan Vaccon

Predictive K-means with local models

Supervised classification can be effective for prediction but sometimes weak on interpretability or explainability (XAI). Clustering, on the other hand, tends to isolate categories or profiles that can be meaningful but there is no…

Machine Learning · Computer Science 2021-04-27 Vincent Lemaire , Oumaima Alaoui Ismaili , Antoine Cornuéjols , Dominique Gay