机器学习 — Scifaro

Differentially Private Sampling from Distributions via Wasserstein Projection

In this paper, we study the problem of sampling from a distribution under the constraint of differential privacy (DP). Prior works measure the utility of DP sampling with density ratio-based measures such as KL divergence. However, such…

机器学习 · 统计学 2026-05-12 Shokichi Takakura , Seng Pei Liew , Satoshi Hasegawa

Unified Approach for Weakly Supervised Multicalibration

Multicalibration requires predicted scores to agree with label probabilities across rich families of subgroups and score-dependent tests, but existing methods require clean input-label pairs for evaluation and post-processing. This…

机器学习 · 统计学 2026-05-12 Futoshi Futami , Takashi Ishida

Supercharging Bayesian Inference with Reliable AI-Informed Priors

Modern predictive systems encode beliefs that can act as useful prior information for statistical inference in data-limited settings. Using them for prior construction introduces a tradeoff: an informative prior built from a predictive…

机器学习 · 统计学 2026-05-12 Jongwoo Choi , Sean O'Hagan

Learning stochastic multiscale models through normalizing flows

Many systems in physics, engineering, and biology exhibit multiscale stochastic dynamics, where low-dimensional slow variables evolve under the influence of high-dimensional fast processes. In practice, observations are often limited to a…

机器学习 · 统计学 2026-05-12 Anan Saha , Arnab Ganguly

Metropolis-Adjusted Diffusion Models

Sampling from score-based diffusion models incurs bias due to both time discretisation and the approximation of the score function. A common strategy for reducing this bias is to apply corrector steps based on the unadjusted Langevin…

机器学习 · 统计学 2026-05-12 Kevin H. Lam , Tyler Farghly , Christopher Williams , Jun Yang , Yee Whye Teh , Arnaud Doucet

Empirical Bayes 1-bit matrix completion

The problem of predicting unobserved entries in a binary matrix, known as 1-bit matrix completion, has found diverse applications in fields such as recommendation systems. In this study, we develop an empirical Bayes method for 1-bit matrix…

机器学习 · 统计学 2026-05-12 Takeru Matsuda

Quantitative Local Convergence of Mean-Field Stein Variational Gradient Flow

Stein Variational Gradient Descent (SVGD) is a deterministic interacting-particle method for sampling from a target probability measure given access to its score function. In the mean-field and continuous-time limit, it is known that the…

机器学习 · 统计学 2026-05-12 Lénaïc Chizat , Maria Colombo , Roberto Colombo , Xavier Fernández-Real

Optimal Regret for Single Index Bandits

We study the $\textit{single-index bandit}$ problem, where rewards depend on an unknown one-dimensional projection of high-dimensional contexts through an unknown reward function. This model extends linear and generalized linear bandits to…

机器学习 · 统计学 2026-05-12 Devdan Dey , Sujoy Bhore , Avishek Ghosh

Optimality of Sub-network Laplace Approximations: New Results and Methods

Although the Laplace approximation offers a simple route to uncertainty quantification in deep neural networks, its reliance on inverting large Hessian matrices has motivated a range of computationally feasible low-dimensional or sparse…

机器学习 · 统计学 2026-05-12 Swarnali Raha , Kshitij Khare , Rohit K Patra

Survey-aware Machine Learning: A Guideline for Valid Population Health Inference based on Scoping Review

Machine Learning (ML) models trained on complex health surveys such as the National Health and Nutrition Examination Survey (NHANES) often ignore primary sampling units, stratification variables, and sampling weights. This practice violates…

机器学习 · 统计学 2026-05-12 YongKyung Oh , Henry W. Zheng , Jeffrey Feng , Alex A. T. Bui

Tight Generalization Bounds for Noiseless Inverse Optimization

Inverse optimization (IO) seeks to infer the parameters of a decision-maker's objective from observed context--action data. We study noiseless IO, where demonstrations are generated by a ground-truth objective. We provide a high-probability…

机器学习 · 统计学 2026-05-12 Pouria Fatemi , Hoomaan Maskan , Suvrit Sra , Peyman Mohajerin Esfahani

Learning Theory of Transformers: Local-to-Global Approximation via Softmax Partition of Unity

This paper investigates the learning theory of Transformer networks for regression tasks on the compact Euclidean domain $[0,1]^d$ and $d$-dimensional compact Riemannian manifolds. We propose a novel constructive approximation framework for…

机器学习 · 统计学 2026-05-12 Zhongjie Shi , Wenjing Liao

Measuring and Decomposing Mode Separation via the Canonical Diffusion

Mode separation, namely how sharply a distribution fragments into barrier-separated clusters, is a fundamental geometric property of densities, difficult to quantify in high dimensions. It is structurally distinct from dispersion, yet…

机器学习 · 统计学 2026-05-12 Shaul Tolkovsky , Ori Meidler , Or Zuk

Core-Halo Decomposition: Decentralizing Large-Scale Fixed-Point Problems

We study solving large-scale fixed-point equation $x^\star=\bar F(x^\star)$ with decomposition. Standard strict decomposition assigns each agent a disjoint block and evaluates updates using only owned coordinates. For most operators,…

机器学习 · 统计学 2026-05-12 Haixiang , Yang Xu , Jiefu Zhang , Xudong Wu , Zihan Zhou , Jun He , Jiayu Chen

CONTRA: Conformal Prediction Region via Normalizing Flow Transformation

Density estimation and reliable prediction regions for outputs are crucial in supervised and unsupervised learning. While conformal prediction effectively generates coverage-guaranteed regions, it struggles with multi-dimensional outputs…

机器学习 · 统计学 2026-05-12 Zhenhan Fang , Aixin Tan , Jian Huang

Learnability and Competition in High-Dimensional Multi-Component ICA

Independent Component Analysis (ICA) is a foundational tool for unsupervised representation learning, yet its high-dimensional theory remains largely limited to single-component recovery. We develop an asymptotically exact mean-field theory…

机器学习 · 统计学 2026-05-12 Eser Ilke Genc , Samet Demir , Zafer Dogan

Sliced Inner Product Gromov-Wasserstein Distances

The Gromov-Wasserstein (GW) problem provides a framework for aligning heterogeneous datasets by matching their intrinsic geometry, but its statistical and computational scaling remains an issue for high-dimensional problems. Slicing…

机器学习 · 统计学 2026-05-12 Xiaoyun Gong , Gabriel Rioux , Ziv Goldfeld

Sinkhorn Treatment Effects: A Causal Optimal Transport Measure

We introduce the Sinkhorn treatment effect, an entropic optimal transport measure of divergence between counterfactual distributions. Unlike classical quantities such as the average treatment effect, this measure captures differences across…

机器学习 · 统计学 2026-05-12 Medha Agarwal , Alex Luedtke

Active Multiple-Prediction-Powered Inference

Post-deployment monitoring of healthcare AI requires statistically valid, label-efficient methods, but gold-standard labels from clinician chart review are expensive. Prediction-powered inference (PPI) and active statistical inference (ASI)…

机器学习 · 统计学 2026-05-12 Nicholas Brawand , Nima Leclerc , Anhthy Ngo , Matthew Peterson , Sriram Vishwanath , Laith Alhussein , Ben Wellner

Decentralized Conformal Novelty Detection via Quantized Model Exchange

This work studies decentralized novelty detection with global false discovery rate (FDR) control across heterogeneous composite null distributions, without sharing the raw data due to privacy and bandwidth considerations. We propose a…

机器学习 · 统计学 2026-05-12 Kyle Loh , Yu Xiang