Related papers: A Kernelised Stein Statistic for Assessing Implici…

A Pure Hypothesis Test for Inhomogeneous Random Graph Models Based on a Kernelised Stein Discrepancy

Complex data are often represented as a graph, which in turn can often be viewed as a realisation of a random graph, such as an inhomogeneous random graph model (IRG). For general fast goodness-of-fit tests in high dimensions, kernelised…

Machine Learning · Statistics 2026-04-02 Anum Fatima , Gesine Reinert

Standardisation-function Kernel Stein Discrepancy: A Unifying View on Kernel Stein Discrepancy Tests for Goodness-of-fit

Non-parametric goodness-of-fit testing procedures based on kernel Stein discrepancies (KSD) are promising approaches to validate general unnormalised distributions in various scenarios. Existing works focused on studying kernel choices to…

Methodology · Statistics 2022-06-02 Wenkai Xu

On RKHS Choices for Assessing Graph Generators via Kernel Stein Statistics

Score-based kernelised Stein discrepancy (KSD) tests have emerged as a powerful tool for the goodness of fit tests, especially in high dimensions; however, the test performance may depend on the choice of kernels in an underlying…

Machine Learning · Statistics 2022-10-13 Moritz Weckbecker , Wenkai Xu , Gesine Reinert

A Kernel Stein Test for Comparing Latent Variable Models

We propose a kernel-based nonparametric test of relative goodness of fit, where the goal is to compare two models, both of which may have unobserved latent variables, such that the marginal distribution of the observed variables is…

Machine Learning · Statistics 2023-05-10 Heishiro Kanagawa , Wittawat Jitkrittum , Lester Mackey , Kenji Fukumizu , Arthur Gretton

A kernel Stein test of goodness of fit for sequential models

We propose a goodness-of-fit measure for probability densities modeling observations with varying dimensionality, such as text documents of differing lengths or variable-length sequences. The proposed measure is an instance of the kernel…

Machine Learning · Statistics 2023-07-14 Jerome Baum , Heishiro Kanagawa , Arthur Gretton

AgraSSt: Approximate Graph Stein Statistics for Interpretable Assessment of Implicit Graph Generators

We propose and analyse a novel statistical procedure, coined AgraSSt, to assess the quality of graph generators that may not be available in explicit form. In particular, AgraSSt can be used to determine whether a learnt graph generating…

Machine Learning · Statistics 2023-08-02 Wenkai Xu , Gesine Reinert

Using Synthetic Data to estimate the True Error is theoretically and practically doable

Accurately evaluating model performance is crucial for deploying machine learning systems in real-world applications. Traditional methods often require a sufficiently large labeled test set to ensure a reliable evaluation. However, in many…

Machine Learning · Computer Science 2025-11-04 Hai Hoang Thanh , Duy-Tung Nguyen , Hung The Tran , Khoat Than

Assessing Generative Models for Structured Data

Synthetic tabular data generation has emerged as a promising method to address limited data availability and privacy concerns. With the sharp increase in the performance of large language models in recent years, researchers have been…

Machine Learning · Computer Science 2025-03-28 Reilly Cannon , Nicolette M. Laird , Caesar Vazquez , Andy Lin , Amy Wagler , Tony Chiang

Using Perturbation to Improve Goodness-of-Fit Tests based on Kernelized Stein Discrepancy

Kernelized Stein discrepancy (KSD) is a score-based discrepancy widely used in goodness-of-fit tests. It can be applied even when the target distribution has an unknown normalising factor, such as in Bayesian analysis. We show theoretically…

Machine Learning · Statistics 2023-06-06 Xing Liu , Andrew B. Duncan , Axel Gandy

Bridging Explicit and Implicit Deep Generative Models via Neural Stein Estimators

There are two types of deep generative models: explicit and implicit. The former defines an explicit density form that allows likelihood inference; while the latter targets a flexible transformation from random noise to generated samples.…

Machine Learning · Computer Science 2021-10-27 Qitian Wu , Rui Gao , Hongyuan Zha

Evaluation of Categorical Generative Models -- Bridging the Gap Between Real and Synthetic Data

The machine learning community has mainly relied on real data to benchmark algorithms as it provides compelling evidence of model applicability. Evaluation on synthetic datasets can be a powerful tool to provide a better understanding of a…

Machine Learning · Computer Science 2022-11-01 Florence Regol , Anja Kroon , Mark Coates

Kernelized Complete Conditional Stein Discrepancy

Much of machine learning relies on comparing distributions with discrepancy measures. Stein's method creates discrepancy measures between two distributions that require only the unnormalized density of one and samples from the other. Stein…

Machine Learning · Statistics 2020-07-21 Raghav Singhal , Xintian Han , Saad Lahlou , Rajesh Ranganath

Data-centric Prediction Explanation via Kernelized Stein Discrepancy

Existing example-based prediction explanation methods often bridge test and training data points through the model's parameters or latent representations. While these methods offer clues to the causes of model predictions, they often…

Machine Learning · Computer Science 2025-05-20 Mahtab Sarvmaili , Hassan Sajjad , Ga Wu

TSynD: Targeted Synthetic Data Generation for Enhanced Medical Image Classification

The usage of medical image data for the training of large-scale machine learning approaches is particularly challenging due to its scarce availability and the costly generation of data annotations, typically requiring the engagement of…

Computer Vision and Pattern Recognition · Computer Science 2024-06-26 Joshua Niemeijer , Jan Ehrhardt , Hristina Uzunova , Heinz Handels

Sliced Kernelized Stein Discrepancy

Kernelized Stein discrepancy (KSD), though being extensively used in goodness-of-fit tests and model learning, suffers from the curse-of-dimensionality. We address this issue by proposing the sliced Stein discrepancy and its scalable and…

Machine Learning · Computer Science 2021-03-18 Wenbo Gong , Yingzhen Li , José Miguel Hernández-Lobato

Fast and Scalable Score-Based Kernel Calibration Tests

We introduce the Kernel Calibration Conditional Stein Discrepancy test (KCCSD test), a non-parametric, kernel-based test for assessing the calibration of probabilistic models with well-defined scores. In contrast to previous methods, our…

Machine Learning · Statistics 2025-10-17 Pierre Glaser , David Widmann , Fredrik Lindsten , Arthur Gretton

KSD Aggregated Goodness-of-fit Test

We investigate properties of goodness-of-fit tests based on the Kernel Stein Discrepancy (KSD). We introduce a strategy to construct a test, called KSDAgg, which aggregates multiple tests with different kernels. KSDAgg avoids splitting the…

Machine Learning · Statistics 2023-12-22 Antonin Schrab , Benjamin Guedj , Arthur Gretton

Differentially Private Synthetic Data Using KD-Trees

Creation of a synthetic dataset that faithfully represents the data distribution and simultaneously preserves privacy is a major research challenge. Many space partitioning based approaches have emerged in recent years for answering…

Cryptography and Security · Computer Science 2023-06-26 Eleonora Kreačić , Navid Nouri , Vamsi K. Potluru , Tucker Balch , Manuela Veloso

Nystr\"om Kernel Stein Discrepancy

Kernel methods underpin many of the most successful approaches in data science and statistics, and they allow representing probability measures as elements of a reproducing kernel Hilbert space without loss of information. Recently, the…

Machine Learning · Statistics 2025-03-19 Florian Kalinke , Zoltan Szabo , Bharath K. Sriperumbudur

Sequential Kernelized Stein Discrepancy

We present a sequential version of the kernelized Stein discrepancy goodness-of-fit test, which allows for conducting goodness-of-fit tests for unnormalized densities that are continuously monitored and adaptively stopped. That is, the…

Machine Learning · Statistics 2025-04-18 Diego Martinez-Taboada , Aaditya Ramdas