机器学习 — Scifaro

BINDy -- Bayesian identification of nonlinear dynamics with reversible-jump Markov-chain Monte-Carlo

Model parsimony is an important \emph{cognitive bias} in data-driven modelling that aids interpretability and helps to prevent over-fitting. Sparse identification of nonlinear dynamics (SINDy) methods are able to learn sparse…

机器学习 · 统计学 2025-06-26 Max D. Champneys , Timothy J. Rogers

Feature learning in finite-width Bayesian deep linear networks with multiple outputs and convolutional layers

Deep linear networks have been extensively studied, as they provide simplified models of deep learning. However, little is known in the case of finite-width architectures with multiple outputs and convolutional layers. In this manuscript,…

机器学习 · 统计学 2025-06-26 Federico Bassetti , Marco Gherardi , Alessandro Ingrosso , Mauro Pastore , Pietro Rotondo

Flexible Infinite-Width Graph Convolutional Neural Networks

A common theoretical approach to understanding neural networks is to take an infinite-width limit, at which point the outputs become Gaussian process (GP) distributed. This is known as a neural network Gaussian process (NNGP). However, the…

机器学习 · 统计学 2025-06-26 Ben Anson , Edward Milsom , Laurence Aitchison

Efficient uniform approximation using Random Vector Functional Link networks

A Random Vector Functional Link (RVFL) network is a depth-2 neural network with random inner weights and biases. Only the outer weights of such an architecture are to be learned, so the learning process boils down to a linear optimization…

机器学习 · 统计学 2025-06-26 Palina Salanevich , Olov Schavemaker

Marginal Pseudo-Likelihood Learning of Markov Network structures

Undirected graphical models known as Markov networks are popular for a wide variety of applications ranging from statistical physics to computational biology. Traditionally, learning of the network structure has been done under the…

机器学习 · 统计学 2025-06-26 Johan Pensar , Henrik Nyman , Juha Niiranen , Jukka Corander

The Shape of Consumer Behavior: A Symbolic and Topological Analysis of Time Series

Understanding temporal patterns in online search behavior is crucial for real-time marketing and trend forecasting. Google Trends offers a rich proxy for public interest, yet the high dimensionality and noise of its time-series data present…

机器学习 · 统计学 2025-06-25 Pola Bereta , Ioannis Diamantis

Near-optimal estimates for the $\ell^p$-Lipschitz constants of deep random ReLU neural networks

This paper studies the $\ell^p$-Lipschitz constants of ReLU neural networks $\Phi: \mathbb{R}^d \to \mathbb{R}$ with random parameters for $p \in [1,\infty]$. The distribution of the weights follows a variant of the He initialization and…

机器学习 · 统计学 2025-06-25 Sjoerd Dirksen , Patrick Finke , Paul Geuchen , Dominik Stöger , Felix Voigtlaender

Rare dense solutions clusters in asymmetric binary perceptrons -- local entropy via fully lifted RDT

We study classical asymmetric binary perceptron (ABP) and associated \emph{local entropy} (LE) as potential source of its algorithmic hardness. Isolation of \emph{typical} ABP solutions in SAT phase seemingly suggests a universal…

机器学习 · 统计学 2025-06-25 Mihailo Stojnic

When Diffusion Models Memorize: Inductive Biases in Probability Flow of Minimum-Norm Shallow Neural Nets

While diffusion models generate high-quality images via probability flow, the theoretical understanding of this process remains incomplete. A key question is when probability flow converges to training samples or more general points on the…

机器学习 · 统计学 2025-06-25 Chen Zeno , Hila Manor , Greg Ongie , Nir Weinberger , Tomer Michaeli , Daniel Soudry

Double Machine Learning for Conditional Moment Restrictions: IV Regression, Proximal Causal Learning and Beyond

Solving conditional moment restrictions (CMRs) is a key problem considered in statistics, causal inference, and econometrics, where the aim is to solve for a function of interest that satisfies some conditional moment equalities.…

机器学习 · 统计学 2025-06-25 Daqian Shao , Ashkan Soleymani , Francesco Quinzan , Marta Kwiatkowska

Information-Theoretic Proofs for Diffusion Sampling

This paper provides an elementary, self-contained analysis of diffusion-based sampling methods for generative modeling. In contrast to existing approaches that rely on continuous-time processes and then discretize, our treatment works…

机器学习 · 统计学 2025-06-25 Galen Reeves , Henry D. Pfister

Continuous Bayesian Model Selection for Multivariate Causal Discovery

Current causal discovery approaches require restrictive model assumptions in the absence of interventional data to ensure structure identifiability. These assumptions often do not hold in real-world applications leading to a loss of…

机器学习 · 统计学 2025-06-25 Anish Dhir , Ruby Sedgwick , Avinash Kori , Ben Glocker , Mark van der Wilk

Do Vendi Scores Converge with Finite Samples? Truncated Vendi Score for Finite-Sample Convergence Guarantees

Evaluating the diversity of generative models without reference data poses methodological challenges. The reference-free Vendi and RKE scores address this by quantifying the diversity of generated data using matrix-based entropy measures.…

机器学习 · 统计学 2025-06-25 Azim Ospanov , Farzan Farnia

Constructive Universal Approximation and Finite Sample Memorization by Narrow Deep ReLU Networks

We present a fully constructive analysis of deep ReLU neural networks for classification and function approximation tasks. First, we prove that any dataset with $N$ distinct points in $\mathbb{R}^d$ and $M$ output classes can be exactly…

机器学习 · 统计学 2025-06-25 Martín Hernández , Enrique Zuazua

Continuous Generative Neural Networks: A Wavelet-Based Architecture in Function Spaces

In this work, we present and study Continuous Generative Neural Networks (CGNNs), namely, generative models in the continuous setting: the output of a CGNN belongs to an infinite-dimensional function space. The architecture is inspired by…

机器学习 · 统计学 2025-06-25 Giovanni S. Alberti , Matteo Santacesaria , Silvia Sciutto

Local Averaging Accurately Distills Manifold Structure From Noisy Data

High-dimensional data are ubiquitous, with examples ranging from natural images to scientific datasets, and often reside near low-dimensional manifolds. Leveraging this geometric structure is vital for downstream tasks, including signal…

机器学习 · 统计学 2025-06-24 Yihan Shen , Shiyu Wang , Arnaud Lamy , Mariam Avagyan , John Wright

Tight Generalization Error Bounds for Stochastic Gradient Descent in Non-convex Learning

Stochastic Gradient Descent (SGD) is fundamental for training deep neural networks, especially in non-convex settings. Understanding SGD's generalization properties is crucial for ensuring robust model performance on unseen data. In this…

机器学习 · 统计学 2025-06-24 Wenjun Xiong , Juan Ding , Xinlei Zuo , Qizhai Li

Theoretical guarantees for neural estimators in parametric statistics

Neural estimators are simulation-based estimators for the parameters of a family of statistical models, which build a direct mapping from the sample to the parameter vector. They benefit from the versatility of available network…

机器学习 · 统计学 2025-06-24 Almut Rödder , Manuel Hentschel , Sebastian Engelke

Phase retrieval with rank $d$ measurements -- \emph{descending} algorithms phase transitions

Companion paper [118] developed a powerful \emph{Random duality theory} (RDT) based analytical program to statistically characterize performance of \emph{descending} phase retrieval algorithms (dPR) (these include all variants of gradient…

机器学习 · 统计学 2025-06-24 Mihailo Stojnic

Optimal spectral initializers impact on phase retrieval phase transitions -- an RDT view

We analyze the relation between spectral initializers and theoretical limits of \emph{descending} phase retrieval algorithms (dPR). In companion paper [104], for any sample complexity ratio, $\alpha$, \emph{parametric manifold}, ${\mathcal…

机器学习 · 统计学 2025-06-24 Mihailo Stojnic