Related papers: Score matching for compositional distributions

Score Matching for Truncated Density Estimation on a Manifold

When observations are truncated, we are limited to an incomplete picture of our dataset. Recent methods propose to use score matching for truncated density estimation, where the access to the intractable normalising constant is not…

Methodology · Statistics 2024-04-15 Daniel J. Williams , Song Liu

Interaction Models and Generalized Score Matching for Compositional Data

Applications such as the analysis of microbiome data have led to renewed interest in statistical methods for compositional data, i.e., multivariate data in the form of probability vectors that contain relative proportions. In particular,…

Methodology · Statistics 2021-09-13 Shiqing Yu , Mathias Drton , Ali Shojaie

Score matching estimators for directional distributions

One of the major problems for maximum likelihood estimation in the well-established directional models is that the normalising constants can be difficult to evaluate. A new general method of "score matching estimation" is presented here on…

Statistics Theory · Mathematics 2016-04-29 Kanti V Mardia , John T Kent , Arnab K Laha

Robust score matching for compositional data

The restricted polynomially-tilted pairwise interaction (RPPI) distribution gives a flexible model for compositional data. It is particularly well-suited to situations where some of the marginal distributions of the components of a…

Methodology · Statistics 2023-05-15 Janice L. Scealy , Kassel L. Hingee , John T. Kent , Andrew T. A. Wood

Score Matching With Missing Data

Score matching is a vital tool for learning the distribution of data with applications across many areas including diffusion processes, energy based modelling, and graphical model estimation. Despite all these applications, little work…

Machine Learning · Statistics 2025-06-03 Josh Givens , Song Liu , Henry W J Reeve

Generalized Score Matching for Regression

Many probabilistic models that have an intractable normalizing constant may be extended to contain covariates. Since the evaluation of the exact likelihood is difficult or even impossible for these models, score matching was proposed to…

Statistics Theory · Mathematics 2022-03-21 Jiazhen Xu , Janice L. Scealy , Andrew T. A. Wood , Tao Zou

A Bayesian Zero-Inflated Dirichlet-Multinomial Regression Model for Multivariate Compositional Count Data

The Dirichlet-multinomial (DM) distribution plays a fundamental role in modern statistical methodology development and application. Recently, the DM distribution and its variants have been used extensively to model multivariate count data…

Methodology · Statistics 2023-02-27 Matthew D. Koslovsky

Scalable Estimation of Dirichlet Process Mixture Models on Distributed Data

We consider the estimation of Dirichlet Process Mixture Models (DPMMs) in distributed environments, where data are distributed across multiple computing nodes. A key advantage of Bayesian nonparametric models such as DPMMs is that they…

Machine Learning · Statistics 2017-09-20 Ruohui Wang , Dahua Lin

Interpretation and Generalization of Score Matching

Score matching is a recently developed parameter learning method that is particularly effective to complicated high dimensional density models with intractable partition functions. In this paper, we study two issues that have not been…

Machine Learning · Computer Science 2012-05-14 Siwei Lyu

Score Matching Riemannian Diffusion Means

Estimating means on Riemannian manifolds is generally computationally expensive because the Riemannian distance function is not known in closed-form for most manifolds. To overcome this, we show that Riemannian diffusion means can be…

Other Statistics · Statistics 2025-02-19 Frederik Möbius Rygaard , Steen Markvorsen , Søren Hauberg , Stefan Sommer

Score Approximation, Estimation and Distribution Recovery of Diffusion Models on Low-Dimensional Data

Diffusion models achieve state-of-the-art performance in various generation tasks. However, their theoretical foundations fall far behind. This paper studies score approximation, estimation, and distribution recovery of diffusion models,…

Machine Learning · Computer Science 2023-02-15 Minshuo Chen , Kaixuan Huang , Tuo Zhao , Mengdi Wang

High-dimensional Log-Error-in-Variable Regression with Applications to Microbial Compositional Data Analysis

In microbiome and genomic studies, the regression of compositional data has been a crucial tool for identifying microbial taxa or genes that are associated with clinical phenotypes. To account for the variation in sequencing depth, the…

Methodology · Statistics 2021-03-11 Pixu Shi , Yuchen Zhou , Anru R. Zhang

Fit Like You Sample: Sample-Efficient Generalized Score Matching from Fast Mixing Diffusions

Score matching is an approach to learning probability distributions parametrized up to a constant of proportionality (e.g. Energy-Based Models). The idea is to fit the score of the distribution, rather than the likelihood, thus avoiding the…

Machine Learning · Computer Science 2024-01-31 Yilong Qin , Andrej Risteski

Sliced Score Matching: A Scalable Approach to Density and Score Estimation

Score matching is a popular method for estimating unnormalized statistical models. However, it has been so far limited to simple, shallow models or low-dimensional data, due to the difficulty of computing the Hessian of log-density…

Machine Learning · Computer Science 2019-06-28 Yang Song , Sahaj Garg , Jiaxin Shi , Stefano Ermon

It's All Relative: New Regression Paradigm for Microbiome Compositional Data

Microbiome data are complex in nature, involving high dimensionality, compositionally, zero inflation, and taxonomic hierarchy. Compositional data reside in a simplex that does not admit the standard Euclidean geometry. Most existing…

Methodology · Statistics 2020-11-12 Gen Li , Yan Li , Kun Chen

Generalized Score Matching

Score matching is an estimation procedure that has been developed for statistical models whose probability density function is known up to proportionality but whose normalizing constant is intractable, so that maximum likelihood is…

Methodology · Statistics 2024-04-23 Jiazhen Xu , Janice L. Scealy , Andrew T. A. Wood , Tao Zou

Robust Score Matching

Proposed in Hyv\"arinen (2005), score matching is a parameter estimation procedure that does not require computation of distributional normalizing constants. In this work we utilize the geometric median of means to develop a robust score…

Machine Learning · Statistics 2025-06-23 Richard Schwank , Andrew McCormack , Mathias Drton

CARE: Large Precision Matrix Estimation for Compositional Data

High-dimensional compositional data are prevalent in many applications. The simplex constraint poses intrinsic challenges to inferring the conditional dependence relationships among the components forming a composition, as encoded by a…

Methodology · Statistics 2024-03-25 Shucong Zhang , Huiyuan Wang , Wei Lin

Improving Compositional Generation with Diffusion Models Using Lift Scores

We introduce a novel resampling criterion using lift scores, for improving compositional generation in diffusion models. By leveraging the lift scores, we evaluate whether generated samples align with each single condition and then compose…

Machine Learning · Computer Science 2025-05-27 Chenning Yu , Sicun Gao

Closed-form conditional diffusion models for data assimilation

We propose closed-form conditional diffusion models for data assimilation. Diffusion models use data to learn the score function (defined as the gradient of the log-probability density of a data distribution), allowing them to generate new…

Machine Learning · Statistics 2026-04-02 Brianna Binder , Agnimitra Dasgupta , Assad Oberai