Related papers: M-decomposability, elliptical unimodal densities, …

Robust M-Estimation Based Bayesian Cluster Enumeration for Real Elliptically Symmetric Distributions

Robustly determining the optimal number of clusters in a data set is an essential factor in a wide range of applications. Cluster enumeration becomes challenging when the true underlying structure in the observed data is corrupted by…

Signal Processing · Electrical Eng. & Systems 2021-05-06 Christian A. Schroth , Michael Muma

A new interpoint distance-based clustering algorithm using kernel density estimation

A novel nonparametric clustering algorithm is proposed using the interpoint distances between the members of the data to reveal the inherent clustering structure existing in the given set of data, where we apply the classical nonparametric…

Methodology · Statistics 2024-09-02 Soumita Modak

A Note on Optimizing Distributions using Kernel Mean Embeddings

Kernel mean embeddings are a popular tool that consists in representing probability measures by their infinite-dimensional mean embeddings in a reproducing kernel Hilbert space. When the kernel is characteristic, mean embeddings can be used…

Machine Learning · Computer Science 2021-06-29 Boris Muzellec , Francis Bach , Alessandro Rudi

Model-based clustering of multivariate binary data with dimension reduction

Clustering methods with dimension reduction have been receiving considerable wide interest in statistics lately and a lot of methods to simultaneously perform clustering and dimension reduction have been proposed. This work presents a novel…

Methodology · Statistics 2014-06-17 Michio Yamamoto , Kenichi Hayashi

Density Estimation on Rectifiable Sets

Kernel density estimation is a popular method for estimating unseen probability distributions. However, the convergence of these classical estimators to the true density slows down in high dimensions. Moreover, they do not define meaningful…

Statistics Theory · Mathematics 2025-05-30 Jack Kendrick

m-irreducible numerical semigroups

In this paper we introduce the notion of m-irreducibility that extends the standard concept of irreducibility of a numerical semigroup when the multiplicity is fixed. We analyze the structure of the set of m-irreducible numerical…

Commutative Algebra · Mathematics 2010-06-18 V. Blanco , J. C. Rosales

Hierarchical orbital decompositions and extended decomposable distributions

Elliptically contoured distributions can be considered to be the distributions for which the contours of the density functions are proportional ellipsoids. Kamiya, Takemura and Kuriki (2006) generalized the elliptically contoured…

Statistics Theory · Mathematics 2008-01-27 Hidehiko Kamiya , Akimichi Takemura

Uniform Transformation of Non-Separable Probability Distributions

A theoretical framework is developed to describe the transformation that distributes probability density functions uniformly over space. In one dimension, the cumulative distribution can be used, but does not generalize to higher…

Neural and Evolutionary Computing · Computer Science 2016-09-08 Eric Kee

Unsupervised Learning Under a General Semiparametric Clusterwise Elliptical Distribution: Efficient Estimation, Optimal Clustering, and Consistent Cluster Selection

We introduce a general semiparametric clusterwise elliptical distribution to assess how latent cluster structure shapes continuous outcomes. Using a subjectwise representation, we first estimate cluster-specific mean vectors and a…

Methodology · Statistics 2026-04-10 Jen-Chieh Teng , Sheng-Hsin Fan , Chin-Tsang Chiang , Ming-Yueh Huang , Alvin Lim

Directional density-based clustering

Density-based clustering methodology has been widely considered in the statistical literature for classifying Euclidean observations. However, this approach has not been contemplated for directional data yet. In this work, directional…

Methodology · Statistics 2023-03-07 Paula Saavedra-Nieves , Martín Fernández-Pérez

Modal clustering of matrix-variate data

The nonparametric formulation of density-based clustering, known as modal clustering, draws a correspondence between groups and the attraction domains of the modes of the density function underlying the data. Its probabilistic foundation…

Methodology · Statistics 2020-10-27 Federico Ferraccioli , Giovanna Menardi

Clustering with Potential Multidimensionality: Inference and Practice

We show how clustering standard errors in one or more dimensions can be justified in M-estimation when there is sampling or assignment uncertainty. Since existing procedures for variance estimation are either conservative or invalid, we…

Econometrics · Economics 2024-11-21 Ruonan Xu , Luther Yap

Denoising diffusion probabilistic models are optimally adaptive to unknown low dimensionality

The denoising diffusion probabilistic model (DDPM) has emerged as a mainstream generative model in generative AI. While sharp convergence guarantees have been established for the DDPM, the iteration complexity is, in general, proportional…

Machine Learning · Computer Science 2026-02-17 Zhihan Huang , Yuting Wei , Yuxin Chen

Smoothing Spline Semiparametric Density Models

Density estimation plays a fundamental role in many areas of statistics and machine learning. Parametric, nonparametric and semiparametric density estimation methods have been proposed in the literature. Semiparametric density models are…

Statistics Theory · Mathematics 2019-01-11 Jian Shi , Jiahui Yu , Anna Liu , Yuedong Wang

A new measure for assessment of clustering based on kernel density estimation

A new clustering accuracy measure is proposed to determine the unknown number of clusters and to assess the quality of clustering of a data set given in any dimensional space. Our validity index applies the classical nonparametric…

Methodology · Statistics 2022-02-15 Soumita Modak

The cluster decomposition of the configurational energy of multicomponent alloys

Lattice models parameterized using first-principles calculations constitute an effective framework to simulate the thermodynamic behavior of physical systems. The cluster expansion method is a flexible lattice-based method used extensively…

Materials Science · Physics 2023-01-09 Luis Barroso-Luque , Gerbrand Ceder

A mixture of ellipsoidal densities for 3D data modelling

In this paper, we propose a new ellipsoidal mixture model. This model is based a new probability density function belonging to the family of elliptical distributions and designed to model points spread around an ellipsoidal surface. Then,…

Methodology · Statistics 2023-09-22 Denis Brazey , Antoine Godichon-Baggioni , Bruno Portier

Differentiability of M-functionals of location and scatter based on t likelihoods

The paper aims at finding widely and smoothly defined nonparametric location and scatter functionals. As a convenient vehicle, maximum likelihood estimation of the location vector m and scatter matrix S of an elliptically symmetric t…

Statistics Theory · Mathematics 2009-03-20 R. M. Dudley , Sergiy Sidenko , Zuoqin Wang

Uniform Convergence Rate of the Kernel Density Estimator Adaptive to Intrinsic Volume Dimension

We derive concentration inequalities for the supremum norm of the difference between a kernel density estimator (KDE) and its point-wise expectation that hold uniformly over the selection of the bandwidth and under weaker conditions on the…

Statistics Theory · Mathematics 2020-01-01 Jisu Kim , Jaehyeok Shin , Alessandro Rinaldo , Larry Wasserman

A Probabilistic $\ell_1$ Method for Clustering High Dimensional Data

In general, the clustering problem is NP-hard, and global optimality cannot be established for non-trivial instances. For high-dimensional data, distance-based methods for clustering or classification face an additional difficulty, the…

Statistics Theory · Mathematics 2016-04-26 Tsvetan Asamov , Adi Ben-Israel