机器学习 — Scifaro

Asymptotics of SGD in Sequence-Single Index Models and Single-Layer Attention Networks

We study the dynamics of stochastic gradient descent (SGD) for a class of sequence models termed Sequence Single-Index (SSI) models, where the target depends on a single direction in input space applied to a sequence of tokens. This setting…

机器学习 · 统计学 2025-11-13 Luca Arnaboldi , Bruno Loureiro , Ludovic Stephan , Florent Krzakala , Lenka Zdeborova

Continuous Symmetry Discovery and Enforcement Using Infinitesimal Generators of Multi-parameter Group Actions

Symmetry-informed machine learning can exhibit advantages over machine learning which fails to account for symmetry. In the context of continuous symmetry detection, current state of the art experiments are largely limited to detecting…

机器学习 · 统计学 2025-11-13 Ben Shaw , Sasidhar Kunapuli , Abram Magner , Kevin R. Moon

Fundamental Limits of Matrix Sensing: Exact Asymptotics, Universality, and Applications

In the matrix sensing problem, one wishes to reconstruct a matrix from (possibly noisy) observations of its linear projections along given directions. We consider this model in the high-dimensional limit: while previous works on this model…

机器学习 · 统计学 2025-11-13 Yizhou Xu , Antoine Maillard , Lenka Zdeborová , Florent Krzakala

Federated Variational Inference for Bayesian Mixture Models

We present a federated learning approach for Bayesian model-based clustering of large-scale binary and categorical datasets. We introduce a principled 'divide and conquer' inference procedure using variational inference with local merge and…

机器学习 · 统计学 2025-11-13 Jackie Rao , Francesca L. Crowe , Tom Marshall , Sylvia Richardson , Paul D. W. Kirk

Testing and Improving the Robustness of Amortized Bayesian Inference for Cognitive Models

Contaminant observations and outliers often cause problems when estimating the parameters of cognitive models, which are statistical models representing cognitive processes. In this study, we test and improve the robustness of parameter…

机器学习 · 统计学 2025-11-13 Yufei Wu , Stefan T. Radev , Francis Tuerlinckx

Building Conformal Prediction Intervals with Approximate Message Passing

Conformal prediction has emerged as a powerful tool for building prediction intervals that are valid in a distribution-free way. However, its evaluation may be computationally costly, especially in the high-dimensional setting where the…

机器学习 · 统计学 2025-11-13 Lucas Clarté , Lenka Zdeborová

Optimal and computationally tractable lower bounds for logistic log-likelihoods

The logit transform is arguably the most widely-employed link function beyond linear settings. This transformation routinely appears in regression models for binary data and provides a central building-block in popular methods for both…

机器学习 · 统计学 2025-11-13 Niccolò Anceschi , Cristian Castiglione , Tommaso Rigon , Giacomo Zanella , Daniele Durante

Dataset-Free Weight-Initialization on Restricted Boltzmann Machine

In feed-forward neural networks, dataset-free weight-initialization methods such as LeCun, Xavier (or Glorot), and He initializations have been developed. These methods randomly determine the initial values of weight parameters based on…

机器学习 · 统计学 2025-11-13 Muneki Yasuda , Ryosuke Maeno , Chako Takahashi

Online Nonparametric Supervised Learning for Massive Data

Despite their benefits in terms of simplicity, low computational cost and data requirement, parametric machine learning algorithms, such as linear discriminant analysis, quadratic discriminant analysis or logistic regression, suffer from…

机器学习 · 统计学 2025-11-13 Mohamed Chaouch , Omama M. Al-Hamed

Estimation and Inference in Distributional Reinforcement Learning

In this paper, we study distributional reinforcement learning from the perspective of statistical efficiency. We investigate distributional policy evaluation, aiming to estimate the complete return distribution (denoted $\eta^\pi$) attained…

机器学习 · 统计学 2025-11-13 Liangyu Zhang , Yang Peng , Jiadong Liang , Wenhao Yang , Zhihua Zhang

Concentration bounds on response-based vector embeddings of black-box generative models

Generative models, such as large language models or text-to-image diffusion models, can generate relevant responses to user-given queries. Response-based vector embeddings of generative models facilitate statistical analysis and inference…

机器学习 · 统计学 2025-11-12 Aranyak Acharyya , Joshua Agterberg , Youngser Park , Carey E. Priebe

PrAda-GAN: A Private Adaptive Generative Adversarial Network with Bayes Network Structure

We revisit the problem of generating synthetic data under differential privacy. To address the core limitations of marginal-based methods, we propose the Private Adaptive Generative Adversarial Network with Bayes Network Structure…

机器学习 · 统计学 2025-11-12 Ke Jia , Yuheng Ma , Yang Li , Feifei Wang

Distributionally Robust Online Markov Game with Linear Function Approximation

The sim-to-real gap, where agents trained in a simulator face significant performance degradation during testing, is a fundamental challenge in reinforcement learning. Extansive works adopt the framework of distributionally robust RL, to…

机器学习 · 统计学 2025-11-12 Zewu Zheng , Yuanyuan Lin

Robust Experimental Design via Generalised Bayesian Inference

Bayesian optimal experimental design is a principled framework for conducting experiments that leverages Bayesian inference to quantify how much information one can expect to gain from selecting a certain design. However, accurate Bayesian…

机器学习 · 统计学 2025-11-12 Yasir Zubayr Barlas , Sabina J. Sloman , Samuel Kaski

Infinite-Dimensional Operator/Block Kaczmarz Algorithms: Regret Bounds and $\lambda$-Effectiveness

We present a variety of projection-based linear regression algorithms with a focus on modern machine-learning models and their algorithmic performance. We study the role of the relaxation parameter in generalized Kaczmarz algorithms and…

机器学习 · 统计学 2025-11-12 Halyun Jeong , Palle E. T. Jorgensen , Hyun-Kyoung Kwon , Myung-Sin Song

Tractable Instances of Bilinear Maximization: Implementing LinUCB on Ellipsoids

We consider the maximization of $x^\top \theta$ over $(x,\theta) \in \mathcal{X} \times \Theta$, with $\mathcal{X} \subset \mathbb{R}^d$ convex and $\Theta \subset \mathbb{R}^d$ an ellipsoid. This problem is fundamental in linear bandits,…

机器学习 · 统计学 2025-11-12 Raymond Zhang , Hédi Hadiji , Richard Combes

A Malliavin calculus approach to score functions in diffusion generative models

Score-based diffusion generative models have recently emerged as a powerful tool for modelling complex data distributions. These models aim at learning the score function, which defines a map from a known probability distribution to the…

机器学习 · 统计学 2025-11-12 Ehsan Mirafzali , Frank Proske , Utkarsh Gupta , Daniele Venturi , Razvan Marinescu

Wasserstein Distributionally Robust Nonparametric Regression

Wasserstein distributionally robust optimization (WDRO) strengthens statistical learning under model uncertainty by minimizing the local worst-case risk within a prescribed ambiguity set. Although WDRO has been extensively studied in…

机器学习 · 统计学 2025-11-12 Changyu Liu , Yuling Jiao , Junhui Wang , Jian Huang

On the Convergence and Stability of Upside-Down Reinforcement Learning, Goal-Conditioned Supervised Learning, and Online Decision Transformers

This article provides a rigorous analysis of convergence and stability of Episodic Upside-Down Reinforcement Learning, Goal-Conditioned Supervised Learning and Online Decision Transformers. These algorithms performed competitively across…

机器学习 · 统计学 2025-11-12 Miroslav Štrupl , Oleg Szehr , Francesco Faccio , Dylan R. Ashley , Rupesh Kumar Srivastava , Jürgen Schmidhuber

Avoiding subtraction and division of stochastic signals using normalizing flows: NFdeconvolve

Across the scientific realm, we find ourselves subtracting or dividing stochastic signals. For instance, consider a stochastic realization, $x$, generated from the addition or multiplication of two stochastic signals $a$ and $b$, namely…

机器学习 · 统计学 2025-11-12 Pedro Pessoa , Max Schweiger , Lance W. Q. Xu , Tristan Manha , Ayush Saurabh , Julian Antolin Camarena , Steve Pressé