机器学习 — Scifaro

A Primer on Variational Inference for Physics-Informed Deep Generative Modelling

Variational inference (VI) is a computationally efficient and scalable methodology for approximate Bayesian inference. It strikes a balance between accuracy of uncertainty quantification and practical tractability. It excels at generative…

机器学习 · 统计学 2025-04-15 Alex Glyn-Davies , Arnaud Vadeboncoeur , O. Deniz Akyildiz , Ieva Kazlauskaite , Mark Girolami

Materials Discovery using Max K-Armed Bandit

Search algorithms for the bandit problems are applicable in materials discovery. However, the objectives of the conventional bandit problem are different from those of materials discovery. The conventional bandit problem aims to maximize…

机器学习 · 统计学 2025-04-15 Nobuaki Kikkawa , Hiroshi Ohno

A Nonparametric Approach with Marginals for Modeling Consumer Choice

Given data on the choices made by consumers for different offer sets, a key challenge is to develop parsimonious models that describe and predict consumer choice behavior while being amenable to prescriptive tasks such as pricing and…

机器学习 · 统计学 2025-04-15 Yanqiu Ruan , Xiaobo Li , Karthyek Murthy , Karthik Natarajan

A Bayesian Model for Online Activity Sample Sizes

In many contexts it is useful to predict the number of individuals in some population who will initiate a particular activity during a given period. For example, the number of users who will install a software update, the number of…

机器学习 · 统计学 2025-04-15 Thomas Richardson , Yu Liu , James McQueen , Doug Hains

Transformer Learns Optimal Variable Selection in Group-Sparse Classification

Transformers have demonstrated remarkable success across various applications. However, the success of transformers have not been understood in theory. In this work, we give a case study of how transformers can be trained to learn a classic…

机器学习 · 统计学 2025-04-14 Chenyang Zhang , Xuran Meng , Yuan Cao

Gradient Descent Robustly Learns the Intrinsic Dimension of Data in Training Convolutional Neural Networks

Modern neural networks are usually highly over-parameterized. Behind the wide usage of over-parameterized networks is the belief that, if the data are simple, then the trained network will be automatically equivalent to a simple predictor.…

机器学习 · 统计学 2025-04-14 Chenyang Zhang , Peifeng Gao , Difan Zou , Yuan Cao

Deep Distributional Learning with Non-crossing Quantile Network

In this paper, we introduce a non-crossing quantile (NQ) network for conditional distribution learning. By leveraging non-negative activation functions, the NQ network ensures that the learned distributions remain monotonic, effectively…

机器学习 · 统计学 2025-04-14 Guohao Shen , Runpeng Dai , Guojun Wu , Shikai Luo , Chengchun Shi , Hongtu Zhu

Microfoundation Inference for Strategic Prediction

Often in prediction tasks, the predictive model itself can influence the distribution of the target variable, a phenomenon termed performative prediction. Generally, this influence stems from strategic actions taken by stakeholders with a…

机器学习 · 统计学 2025-04-14 Daniele Bracale , Subha Maity , Felipe Maia Polo , Seamus Somerstep , Moulinath Banerjee , Yuekai Sun

Manifolds, Random Matrices and Spectral Gaps: The geometric phases of generative diffusion

In this paper, we investigate the latent geometry of generative diffusion models under the manifold hypothesis. For this purpose, we analyze the spectrum of eigenvalues (and singular values) of the Jacobian of the score function, whose…

机器学习 · 统计学 2025-04-14 Enrico Ventura , Beatrice Achilli , Gianluigi Silvestri , Carlo Lucibello , Luca Ambrogioni

Learning the Distribution Map in Reverse Causal Performative Prediction

In numerous predictive scenarios, the predictive model affects the sampling distribution; for example, job applicants often meticulously craft their resumes to navigate through a screening systems. Such shifts in distribution are…

机器学习 · 统计学 2025-04-14 Daniele Bracale , Subha Maity , Moulinath Banerjee , Yuekai Sun

Understanding Optimal Feature Transfer via a Fine-Grained Bias-Variance Analysis

In the transfer learning paradigm models learn useful representations (or features) during a data-rich pretraining stage, and then use the pretrained representation to improve model performance on data-scarce downstream tasks. In this work,…

机器学习 · 统计学 2025-04-14 Yufan Li , Subhabrata Sen , Ben Adlam

Optimal Rates and Saturation for Noiseless Kernel Ridge Regression

Kernel ridge regression (KRR), also known as the least-squares support vector machine, is a fundamental method for learning functions from finite samples. While most existing analyses focus on the noisy setting with constant-level label…

机器学习 · 统计学 2025-04-14 Jihao Long , Xiaojun Peng , Lei Wu

Wasserstein Gradient Flows for Moreau Envelopes of f-Divergences in Reproducing Kernel Hilbert Spaces

Commonly used $f$-divergences of measures, e.g., the Kullback-Leibler divergence, are subject to limitations regarding the support of the involved measures. A remedy is regularizing the $f$-divergence by a squared maximum mean discrepancy…

机器学习 · 统计学 2025-04-14 Viktor Stein , Sebastian Neumayer , Nicolaj Rux , Gabriele Steidl

Can SGD Select Good Fishermen? Local Convergence under Self-Selection Biases and Beyond

We revisit the problem of estimating $k$ linear regressors with self-selection bias in $d$ dimensions with the maximum selection criterion, as introduced by Cherapanamjeri, Daskalakis, Ilyas, and Zampetakis [CDIZ23, STOC'23]. Our main…

机器学习 · 统计学 2025-04-11 Alkis Kalavasis , Anay Mehrotra , Felix Zhou

DCSI -- An improved measure of cluster separability based on separation and connectedness

Whether class labels in a given data set correspond to meaningful clusters is crucial for the evaluation of clustering algorithms using real-world data sets. This property can be quantified by separability measures. The central aspects of…

机器学习 · 统计学 2025-04-11 Jana Gauss , Fabian Scheipl , Moritz Herrmann

A Theory of Non-Linear Feature Learning with One Gradient Step in Two-Layer Neural Networks

Feature learning is thought to be one of the fundamental reasons for the success of deep neural networks. It is rigorously known that in two-layer fully-connected neural networks under certain conditions, one step of gradient descent on the…

机器学习 · 统计学 2025-04-11 Behrad Moniri , Donghwan Lee , Hamed Hassani , Edgar Dobriban

Deep Fair Learning: A Unified Framework for Fine-tuning Representations with Sufficient Networks

Ensuring fairness in machine learning is a critical and challenging task, as biased data representations often lead to unfair predictions. To address this, we propose Deep Fair Learning, a framework that integrates nonlinear sufficient…

机器学习 · 统计学 2025-04-10 Enze Shi , Linglong Kong , Bei Jiang

Scalable Geometric Learning with Correlation-Based Functional Brain Networks

The correlation matrix is a central representation of functional brain networks in neuroimaging. Traditional analyses often treat pairwise interactions independently in a Euclidean setting, overlooking the intrinsic geometry of correlation…

机器学习 · 统计学 2025-04-10 Kisung You , Yelim Lee , Hae-Jeong Park

Accelerated Stein Variational Gradient Flow

Stein variational gradient descent (SVGD) is a kernel-based particle method for sampling from a target distribution, e.g., in generative modeling and Bayesian inference. SVGD does not require estimating the gradient of the log-density,…

机器学习 · 统计学 2025-04-10 Viktor Stein , Wuchen Li

Off-the-grid learning of mixtures from a continuous dictionary

We consider a general non-linear model where the signal is a finite mixture of an unknown, possibly increasing, number of features issued from a continuous dictionary parameterized by a real non-linear parameter. The signal is observed with…

机器学习 · 统计学 2025-04-10 Cristina Butucea , Jean-François Delmas , Anne Dutfoy , Clément Hardy