机器学习 — Scifaro

Bit-Level Discrete Diffusion with Markov Probabilistic Models: An Improved Framework with Sharp Convergence Bounds under Minimal Assumptions

This paper introduces Discrete Markov Probabilistic Models (DMPMs), a novel discrete diffusion algorithm for discrete data generation. The algorithm operates in discrete bit space, where the noising process is a continuous-time Markov chain…

机器学习 · 统计学 2025-10-09 Le-Tuyet-Nhi Pham , Dario Shariatian , Antonio Ocello , Giovanni Conforti , Alain Durmus

Ratio Divergence Learning Using Target Energy in Restricted Boltzmann Machines: Beyond Kullback--Leibler Divergence Learning

We propose ratio divergence (RD) learning for discrete energy-based models, a method that utilizes both training data and a tractable target energy function. We apply RD learning to restricted Boltzmann machines (RBMs), which are a minimal…

机器学习 · 统计学 2025-10-09 Yuichi Ishida , Yuma Ichikawa , Aki Dote , Toshiyuki Miyazawa , Koji Hukushima

An Empirical Analysis of the Laplace and Neural Tangent Kernels

The neural tangent kernel is a kernel function defined over the parameter distribution of an infinite width neural network. Despite the impracticality of this limit, the neural tangent kernel has allowed for a more direct study of neural…

机器学习 · 统计学 2025-10-09 Ronaldas Paulius Lencevičius

Implicit Updates for Average-Reward Temporal Difference Learning

Temporal difference (TD) learning is a cornerstone of reinforcement learning. In the average-reward setting, standard TD($\lambda$) is highly sensitive to the choice of step-size and thus requires careful tuning to maintain numerical…

机器学习 · 统计学 2025-10-08 Hwanwoo Kim , Dongkyu Derek Cho , Eric Laber

Bilevel optimization for learning hyperparameters: Application to solving PDEs and inverse problems with Gaussian processes

Methods for solving scientific computing and inference problems, such as kernel- and neural network-based approaches for partial differential equations (PDEs), inverse problems, and supervised learning tasks, depend crucially on the choice…

机器学习 · 统计学 2025-10-08 Nicholas H. Nelsen , Houman Owhadi , Andrew M. Stuart , Xianjin Yang , Zongren Zou

Domain-Shift-Aware Conformal Prediction for Large Language Models

Large language models have achieved impressive performance across diverse tasks. However, their tendency to produce overconfident and factually incorrect outputs, known as hallucinations, poses risks in real world applications. Conformal…

机器学习 · 统计学 2025-10-08 Zhexiao Lin , Yuanyuan Li , Neeraj Sarna , Yuanyuan Gao , Michael von Gablenz

A Probabilistic Basis for Low-Rank Matrix Learning

Low rank inference on matrices is widely conducted by optimizing a cost function augmented with a penalty proportional to the nuclear norm $\Vert \cdot \Vert_*$. However, despite the assortment of computational methods for such problems,…

机器学习 · 统计学 2025-10-08 Simon Segert , Nathan Wycoff

Minima and Critical Points of the Bethe Free Energy Are Invariant Under Deformation Retractions of Factor Graphs

In graphical models, factor graphs, and more generally energy-based models, the interactions between variables are encoded by a graph, a hypergraph, or, in the most general case, a partially ordered set (poset). Inference on such…

机器学习 · 统计学 2025-10-08 Grégoire Sergeant-Perthuis , Léo Boitel

Expected Free Energy-based Planning as Variational Inference

We address the problem of planning under uncertainty, where an agent must choose actions that not only achieve desired outcomes but also reduce uncertainty. Traditional methods often treat exploration and exploitation as separate…

机器学习 · 统计学 2025-10-08 Bert de Vries , Wouter Nuijten , Thijs van de Laar , Wouter Kouw , Sepideh Adamiat , Tim Nisslbeck , Mykola Lukashchuk , Hoang Minh Huu Nguyen , Marco Hidalgo Araya , Raphael Tresor , Thijs Jenneskens , Ivana Nikoloska , Raaja Ganapathy Subramanian , Bart van Erp , Dmitry Bagaev , Albert Podusenko

Fundamental Limits of Membership Inference Attacks on Machine Learning Models

Membership inference attacks (MIA) can reveal whether a particular data point was part of the training dataset, potentially exposing sensitive information about individuals. This article provides theoretical guarantees by exploring the…

机器学习 · 统计学 2025-10-08 Eric Aubinais , Elisabeth Gassiat , Pablo Piantanida

Model-free generalized fiducial inference

Conformal prediction (CP) was developed to provide finite-sample probabilistic prediction guarantees. While CP algorithms are a relatively general-purpose approach to uncertainty quantification, with finite-sample guarantees, they lack…

机器学习 · 统计学 2025-10-08 Jonathan P Williams

Causal Abstractions, Categorically Unified

We present a categorical framework for relating causal models that represent the same system at different levels of abstraction. We define a causal abstraction as natural transformations between appropriate Markov functors, which concisely…

机器学习 · 统计学 2025-10-07 Markus Englberger , Devendra Singh Dhami

Set to Be Fair: Demographic Parity Constraints for Set-Valued Classification

Set-valued classification is used in multiclass settings where confusion between classes can occur and lead to misleading predictions. However, its application may amplify discriminatory bias motivating the development of set-valued…

机器学习 · 统计学 2025-10-07 Eyal Cohen , Christophe Denis , Mohamed Hebiri

A Noise Resilient Approach for Robust Hurst Exponent Estimation

Understanding signal behavior across scales is vital in areas such as natural phenomena analysis and financial modeling. A key property is self-similarity, quantified by the Hurst exponent (H), which reveals long-term dependencies.…

机器学习 · 统计学 2025-10-07 Malith Premarathna , Fabrizio Ruggeri , Dixon Vimalajeewa

Kernel ridge regression under power-law data: spectrum and generalization

In this work, we investigate high-dimensional kernel ridge regression (KRR) on i.i.d. Gaussian data with anisotropic power-law covariance. This setting differs fundamentally from the classical source & capacity conditions for KRR, where…

机器学习 · 统计学 2025-10-07 Arie Wortsman , Bruno Loureiro

Fisher-Bingham-like normalizing flows on the sphere

A generic D-dimensional Gaussian can be conditioned or projected onto the D-1 unit sphere, thereby leading to the well-known Fisher-Bingham (FB) or Angular Gaussian (AG) distribution families, respectively. These are some of the most…

机器学习 · 统计学 2025-10-07 Thorsten Glüsenkamp

Divergence Phase Index: A Riesz-Transform Framework for Multidimensional Phase Difference Analysis

We introduce the Divergence Phase Index (DPI), a novel framework for quantifying phase differences in one and multidimensional signals, grounded in harmonic analysis via the Riesz transform. Based on classical Hilbert Transform phase…

机器学习 · 统计学 2025-10-07 Magaly Catanzariti , Hugo Aimar , Diego M. Mateos

Transformed $\ell_1$ Regularizations for Robust Principal Component Analysis: Toward a Fine-Grained Understanding

Robust Principal Component Analysis (RPCA) aims to recover a low-rank structure from noisy, partially observed data that is also corrupted by sparse, potentially large-magnitude outliers. Traditional RPCA models rely on convex relaxations,…

机器学习 · 统计学 2025-10-07 Kun Zhao , Haoke Zhang , Jiayi Wang , Yifei Lou

Mathematically rigorous proofs for Shapley explanations

Machine Learning is becoming increasingly more important in today's world. It is therefore very important to provide understanding of the decision-making process of machine-learning models. A popular way to do this is by looking at the…

机器学习 · 统计学 2025-10-07 David van Batenburg

Quantile-Scaled Bayesian Optimization Using Rank-Only Feedback

Bayesian Optimization (BO) is widely used for optimizing expensive black-box functions, particularly in hyperparameter tuning. However, standard BO assumes access to precise objective values, which may be unavailable, noisy, or unreliable…

机器学习 · 统计学 2025-10-07 Tunde Fahd Egunjobi