English
Related papers

Related papers: A Kernelised Stein Statistic for Assessing Implici…

200 papers

Complex data are often represented as a graph, which in turn can often be viewed as a realisation of a random graph, such as an inhomogeneous random graph model (IRG). For general fast goodness-of-fit tests in high dimensions, kernelised…

Machine Learning · Statistics 2026-04-02 Anum Fatima , Gesine Reinert

Non-parametric goodness-of-fit testing procedures based on kernel Stein discrepancies (KSD) are promising approaches to validate general unnormalised distributions in various scenarios. Existing works focused on studying kernel choices to…

Methodology · Statistics 2022-06-02 Wenkai Xu

Score-based kernelised Stein discrepancy (KSD) tests have emerged as a powerful tool for the goodness of fit tests, especially in high dimensions; however, the test performance may depend on the choice of kernels in an underlying…

Machine Learning · Statistics 2022-10-13 Moritz Weckbecker , Wenkai Xu , Gesine Reinert

We propose a kernel-based nonparametric test of relative goodness of fit, where the goal is to compare two models, both of which may have unobserved latent variables, such that the marginal distribution of the observed variables is…

Machine Learning · Statistics 2023-05-10 Heishiro Kanagawa , Wittawat Jitkrittum , Lester Mackey , Kenji Fukumizu , Arthur Gretton

We propose a goodness-of-fit measure for probability densities modeling observations with varying dimensionality, such as text documents of differing lengths or variable-length sequences. The proposed measure is an instance of the kernel…

Machine Learning · Statistics 2023-07-14 Jerome Baum , Heishiro Kanagawa , Arthur Gretton

We propose and analyse a novel statistical procedure, coined AgraSSt, to assess the quality of graph generators that may not be available in explicit form. In particular, AgraSSt can be used to determine whether a learnt graph generating…

Machine Learning · Statistics 2023-08-02 Wenkai Xu , Gesine Reinert

Accurately evaluating model performance is crucial for deploying machine learning systems in real-world applications. Traditional methods often require a sufficiently large labeled test set to ensure a reliable evaluation. However, in many…

Machine Learning · Computer Science 2025-11-04 Hai Hoang Thanh , Duy-Tung Nguyen , Hung The Tran , Khoat Than

Synthetic tabular data generation has emerged as a promising method to address limited data availability and privacy concerns. With the sharp increase in the performance of large language models in recent years, researchers have been…

Machine Learning · Computer Science 2025-03-28 Reilly Cannon , Nicolette M. Laird , Caesar Vazquez , Andy Lin , Amy Wagler , Tony Chiang

Kernelized Stein discrepancy (KSD) is a score-based discrepancy widely used in goodness-of-fit tests. It can be applied even when the target distribution has an unknown normalising factor, such as in Bayesian analysis. We show theoretically…

Machine Learning · Statistics 2023-06-06 Xing Liu , Andrew B. Duncan , Axel Gandy

There are two types of deep generative models: explicit and implicit. The former defines an explicit density form that allows likelihood inference; while the latter targets a flexible transformation from random noise to generated samples.…

Machine Learning · Computer Science 2021-10-27 Qitian Wu , Rui Gao , Hongyuan Zha

The machine learning community has mainly relied on real data to benchmark algorithms as it provides compelling evidence of model applicability. Evaluation on synthetic datasets can be a powerful tool to provide a better understanding of a…

Machine Learning · Computer Science 2022-11-01 Florence Regol , Anja Kroon , Mark Coates

Much of machine learning relies on comparing distributions with discrepancy measures. Stein's method creates discrepancy measures between two distributions that require only the unnormalized density of one and samples from the other. Stein…

Machine Learning · Statistics 2020-07-21 Raghav Singhal , Xintian Han , Saad Lahlou , Rajesh Ranganath

Existing example-based prediction explanation methods often bridge test and training data points through the model's parameters or latent representations. While these methods offer clues to the causes of model predictions, they often…

Machine Learning · Computer Science 2025-05-20 Mahtab Sarvmaili , Hassan Sajjad , Ga Wu

The usage of medical image data for the training of large-scale machine learning approaches is particularly challenging due to its scarce availability and the costly generation of data annotations, typically requiring the engagement of…

Computer Vision and Pattern Recognition · Computer Science 2024-06-26 Joshua Niemeijer , Jan Ehrhardt , Hristina Uzunova , Heinz Handels

Kernelized Stein discrepancy (KSD), though being extensively used in goodness-of-fit tests and model learning, suffers from the curse-of-dimensionality. We address this issue by proposing the sliced Stein discrepancy and its scalable and…

Machine Learning · Computer Science 2021-03-18 Wenbo Gong , Yingzhen Li , José Miguel Hernández-Lobato

We introduce the Kernel Calibration Conditional Stein Discrepancy test (KCCSD test), a non-parametric, kernel-based test for assessing the calibration of probabilistic models with well-defined scores. In contrast to previous methods, our…

Machine Learning · Statistics 2025-10-17 Pierre Glaser , David Widmann , Fredrik Lindsten , Arthur Gretton

We investigate properties of goodness-of-fit tests based on the Kernel Stein Discrepancy (KSD). We introduce a strategy to construct a test, called KSDAgg, which aggregates multiple tests with different kernels. KSDAgg avoids splitting the…

Machine Learning · Statistics 2023-12-22 Antonin Schrab , Benjamin Guedj , Arthur Gretton

Creation of a synthetic dataset that faithfully represents the data distribution and simultaneously preserves privacy is a major research challenge. Many space partitioning based approaches have emerged in recent years for answering…

Cryptography and Security · Computer Science 2023-06-26 Eleonora Kreačić , Navid Nouri , Vamsi K. Potluru , Tucker Balch , Manuela Veloso

Kernel methods underpin many of the most successful approaches in data science and statistics, and they allow representing probability measures as elements of a reproducing kernel Hilbert space without loss of information. Recently, the…

Machine Learning · Statistics 2025-03-19 Florian Kalinke , Zoltan Szabo , Bharath K. Sriperumbudur

We present a sequential version of the kernelized Stein discrepancy goodness-of-fit test, which allows for conducting goodness-of-fit tests for unnormalized densities that are continuously monitored and adaptively stopped. That is, the…

Machine Learning · Statistics 2025-04-18 Diego Martinez-Taboada , Aaditya Ramdas
‹ Prev 1 2 3 10 Next ›