English
Related papers

Related papers: Score matching for compositional distributions

200 papers

When observations are truncated, we are limited to an incomplete picture of our dataset. Recent methods propose to use score matching for truncated density estimation, where the access to the intractable normalising constant is not…

Methodology · Statistics 2024-04-15 Daniel J. Williams , Song Liu

Applications such as the analysis of microbiome data have led to renewed interest in statistical methods for compositional data, i.e., multivariate data in the form of probability vectors that contain relative proportions. In particular,…

Methodology · Statistics 2021-09-13 Shiqing Yu , Mathias Drton , Ali Shojaie

One of the major problems for maximum likelihood estimation in the well-established directional models is that the normalising constants can be difficult to evaluate. A new general method of "score matching estimation" is presented here on…

Statistics Theory · Mathematics 2016-04-29 Kanti V Mardia , John T Kent , Arnab K Laha

The restricted polynomially-tilted pairwise interaction (RPPI) distribution gives a flexible model for compositional data. It is particularly well-suited to situations where some of the marginal distributions of the components of a…

Methodology · Statistics 2023-05-15 Janice L. Scealy , Kassel L. Hingee , John T. Kent , Andrew T. A. Wood

Score matching is a vital tool for learning the distribution of data with applications across many areas including diffusion processes, energy based modelling, and graphical model estimation. Despite all these applications, little work…

Machine Learning · Statistics 2025-06-03 Josh Givens , Song Liu , Henry W J Reeve

Many probabilistic models that have an intractable normalizing constant may be extended to contain covariates. Since the evaluation of the exact likelihood is difficult or even impossible for these models, score matching was proposed to…

Statistics Theory · Mathematics 2022-03-21 Jiazhen Xu , Janice L. Scealy , Andrew T. A. Wood , Tao Zou

The Dirichlet-multinomial (DM) distribution plays a fundamental role in modern statistical methodology development and application. Recently, the DM distribution and its variants have been used extensively to model multivariate count data…

Methodology · Statistics 2023-02-27 Matthew D. Koslovsky

We consider the estimation of Dirichlet Process Mixture Models (DPMMs) in distributed environments, where data are distributed across multiple computing nodes. A key advantage of Bayesian nonparametric models such as DPMMs is that they…

Machine Learning · Statistics 2017-09-20 Ruohui Wang , Dahua Lin

Score matching is a recently developed parameter learning method that is particularly effective to complicated high dimensional density models with intractable partition functions. In this paper, we study two issues that have not been…

Machine Learning · Computer Science 2012-05-14 Siwei Lyu

Estimating means on Riemannian manifolds is generally computationally expensive because the Riemannian distance function is not known in closed-form for most manifolds. To overcome this, we show that Riemannian diffusion means can be…

Other Statistics · Statistics 2025-02-19 Frederik Möbius Rygaard , Steen Markvorsen , Søren Hauberg , Stefan Sommer

Diffusion models achieve state-of-the-art performance in various generation tasks. However, their theoretical foundations fall far behind. This paper studies score approximation, estimation, and distribution recovery of diffusion models,…

Machine Learning · Computer Science 2023-02-15 Minshuo Chen , Kaixuan Huang , Tuo Zhao , Mengdi Wang

In microbiome and genomic studies, the regression of compositional data has been a crucial tool for identifying microbial taxa or genes that are associated with clinical phenotypes. To account for the variation in sequencing depth, the…

Methodology · Statistics 2021-03-11 Pixu Shi , Yuchen Zhou , Anru R. Zhang

Score matching is an approach to learning probability distributions parametrized up to a constant of proportionality (e.g. Energy-Based Models). The idea is to fit the score of the distribution, rather than the likelihood, thus avoiding the…

Machine Learning · Computer Science 2024-01-31 Yilong Qin , Andrej Risteski

Score matching is a popular method for estimating unnormalized statistical models. However, it has been so far limited to simple, shallow models or low-dimensional data, due to the difficulty of computing the Hessian of log-density…

Machine Learning · Computer Science 2019-06-28 Yang Song , Sahaj Garg , Jiaxin Shi , Stefano Ermon

Microbiome data are complex in nature, involving high dimensionality, compositionally, zero inflation, and taxonomic hierarchy. Compositional data reside in a simplex that does not admit the standard Euclidean geometry. Most existing…

Methodology · Statistics 2020-11-12 Gen Li , Yan Li , Kun Chen

Score matching is an estimation procedure that has been developed for statistical models whose probability density function is known up to proportionality but whose normalizing constant is intractable, so that maximum likelihood is…

Methodology · Statistics 2024-04-23 Jiazhen Xu , Janice L. Scealy , Andrew T. A. Wood , Tao Zou

Proposed in Hyv\"arinen (2005), score matching is a parameter estimation procedure that does not require computation of distributional normalizing constants. In this work we utilize the geometric median of means to develop a robust score…

Machine Learning · Statistics 2025-06-23 Richard Schwank , Andrew McCormack , Mathias Drton

High-dimensional compositional data are prevalent in many applications. The simplex constraint poses intrinsic challenges to inferring the conditional dependence relationships among the components forming a composition, as encoded by a…

Methodology · Statistics 2024-03-25 Shucong Zhang , Huiyuan Wang , Wei Lin

We introduce a novel resampling criterion using lift scores, for improving compositional generation in diffusion models. By leveraging the lift scores, we evaluate whether generated samples align with each single condition and then compose…

Machine Learning · Computer Science 2025-05-27 Chenning Yu , Sicun Gao

We propose closed-form conditional diffusion models for data assimilation. Diffusion models use data to learn the score function (defined as the gradient of the log-probability density of a data distribution), allowing them to generate new…

Machine Learning · Statistics 2026-04-02 Brianna Binder , Agnimitra Dasgupta , Assad Oberai
‹ Prev 1 2 3 10 Next ›