机器学习 — Scifaro

Berezinskii--Kosterlitz--Thouless transition in a context-sensitive random language model

Several power-law critical properties involving different statistics in natural languages -- reminiscent of scaling properties of physical systems at or near phase transitions -- have been documented for decades. The recent rise of large…

机器学习 · 统计学 2026-01-23 Yuma Toji , Jun Takahashi , Vwani Roychowdhury , Hideyuki Miyahara

Many Experiments, Few Repetitions, Unpaired Data, and Sparse Effects: Is Causal Inference Possible?

We study the problem of estimating causal effects under hidden confounding in the following unpaired data setting: we observe some covariates $X$ and an outcome $Y$ under different experimental conditions (environments) but do not observe…

机器学习 · 统计学 2026-01-22 Felix Schur , Niklas Pfister , Peng Ding , Sach Mukherjee , Jonas Peters

Multi-context principal component analysis

Principal component analysis (PCA) is a tool to capture factors that explain variation in data. Across domains, data are now collected across multiple contexts (for example, individuals with different diseases, cells of different types, or…

机器学习 · 统计学 2026-01-22 Kexin Wang , Salil Bhate , João M. Pereira , Joe Kileel , Matylda Figlerowicz , Anna Seigal

Semi-Supervised Mixture Models under the Concept of Missing at Radom with Margin Confidence and Aranda Ordaz Function

This paper presents a semi-supervised learning framework for Gaussian mixture modelling under a Missing at Random (MAR) mechanism. The method explicitly parameterizes the missingness mechanism by modelling the probability of missingness as…

机器学习 · 统计学 2026-01-22 Jinyang Liao , Ziyang Lyu

Communication-Efficient Federated Risk Difference Estimation for Time-to-Event Clinical Outcomes

Privacy-preserving model co-training in medical research is often hindered by server-dependent architectures incompatible with protected hospital data systems and by the predominant focus on relative effect measures (hazard ratios) which…

机器学习 · 统计学 2026-01-22 Ziwen Wang , Siqi Li , Marcus Eng Hock Ong , Nan Liu

Large Data Limits of Laplace Learning for Gaussian Measure Data in Infinite Dimensions

Laplace learning is a semi-supervised method, a solution for finding missing labels from a partially labeled dataset utilizing the geometry given by the unlabeled data points. The method minimizes a Dirichlet energy defined on a (discrete)…

机器学习 · 统计学 2026-01-22 Zhengang Zhong , Yury Korolev , Matthew Thorpe

Meta Flow Maps enable scalable reward alignment

Controlling generative models is computationally expensive. This is because optimal alignment with a reward function--whether via inference-time steering or fine-tuning--requires estimating the value function. This task demands access to…

机器学习 · 统计学 2026-01-22 Peter Potaptchik , Adhi Saravanan , Abbas Mammadov , Alvaro Prat , Michael S. Albergo , Yee Whye Teh

Whitening Spherical Gaussian Mixtures in the Large-Dimensional Regime

Whitening is a classical technique in unsupervised learning that can facilitate estimation tasks by standardizing data. An important application is the estimation of latent variable models via the decomposition of tensors built from…

机器学习 · 统计学 2026-01-22 Mohammed Racim Moussa Boudjemaa , Alper Kalle , Xiaoyi Mai , José Henrique de Morais Goulart , Cédric Févotte

Dynamic angular synchronization under smoothness constraints

Given an undirected measurement graph $\mathcal{H} = ([n], \mathcal{E})$, the classical angular synchronization problem consists of recovering unknown angles $\theta_1^*,\dots,\theta_n^*$ from a collection of noisy pairwise measurements of…

机器学习 · 统计学 2026-01-22 Ernesto Araya , Mihai Cucuringu , Hemant Tyagi

Online Statistical Inference for Contextual Bandits via Stochastic Gradient Descent

With the fast development of big data, learning the optimal decision rule by recursively updating it and making online decisions has been easier than before. We study the online statistical inference of model parameters in a contextual…

机器学习 · 统计学 2026-01-22 Xiangyu Chang , Xi Chen , Zehua Lai , He Li , Zhihong Liu , Yichen Zhang

Intermittent time series forecasting: local vs global models

Intermittent time series, characterised by the presence of a significant amount of zeros, constitute a large percentage of inventory items in supply chain. Probabilistic forecasts are needed to plan the inventory levels; the predictive…

机器学习 · 统计学 2026-01-21 Stefano Damato , Nicolò Rubattu , Dario Azzimonti , Giorgio Corani

Sample Complexity of Average-Reward Q-Learning: From Single-agent to Federated Reinforcement Learning

Average-reward reinforcement learning offers a principled framework for long-term decision-making by maximizing the mean reward per time step. Although Q-learning is a widely used model-free algorithm with established sample complexity in…

机器学习 · 统计学 2026-01-21 Yuchen Jiao , Jiin Woo , Gen Li , Gauri Joshi , Yuejie Chi

Distribution-Free Confidence Ellipsoids for Ridge Regression with PAC Bounds

Linearly parametrized models are widely used in control and signal processing, with the least-squares (LS) estimate being the archetypical solution. When the input is insufficiently exciting, the LS problem may be unsolvable or numerically…

机器学习 · 统计学 2026-01-21 Szabolcs Szentpéteri , Balázs Csanád Csáji

Empirical Risk Minimization with $f$-Divergence Regularization

In this paper, the solution to the empirical risk minimization problem with $f$-divergence regularization (ERM-$f$DR) is presented and conditions under which the solution also serves as the solution to the minimization of the expected…

机器学习 · 统计学 2026-01-21 Francisco Daunas , Iñaki Esnaola , Samir M. Perlaza , H. Vincent Poor

A Theory of Diversity for Random Matrices with Applications to In-Context Learning of Schr\"odinger Equations

We address the following question: given a collection $\{\mathbf{A}^{(1)}, \dots, \mathbf{A}^{(N)}\}$ of independent $d \times d$ random matrices drawn from a common distribution $\mathbb{P}$, what is the probability that the centralizer of…

机器学习 · 统计学 2026-01-21 Frank Cole , Yulong Lu , Shaurya Sehgal

A Kernel Approach for Semi-implicit Variational Inference

Semi-implicit variational inference (SIVI) enhances the expressiveness of variational families through hierarchical semi-implicit distributions, but the intractability of their densities makes standard ELBO-based optimization biased. Recent…

机器学习 · 统计学 2026-01-21 Longlin Yu , Ziheng Cheng , Shiyue Zhang , Cheng Zhang

Gradient-based Active Learning with Gaussian Processes for Global Sensitivity Analysis

Global sensitivity analysis of complex numerical simulators is often limited by the small number of model evaluations that can be afforded. In such settings, surrogate models built from a limited set of simulations can substantially reduce…

机器学习 · 统计学 2026-01-21 Guerlain Lambert , Céline Helbert , Claire Lauvernet

Memorize Early, Then Query: Inlier-Memorization-Guided Active Outlier Detection

Outlier detection (OD) aims to identify abnormal instances, known as outliers or anomalies, by learning typical patterns of normal data, or inliers. Performing OD under an unsupervised regime-without any information about anomalous…

机器学习 · 统计学 2026-01-21 Minseo Kang , Seunghwan Park , Dongha Kim

Heuristics for Combinatorial Optimization via Value-based Reinforcement Learning: A Unified Framework and Analysis

Since the 1990s, considerable empirical work has been carried out to train statistical models, such as neural networks (NNs), as learned heuristics for combinatorial optimization (CO) problems. When successful, such an approach eliminates…

机器学习 · 统计学 2026-01-21 Orit Davidovich , Shimrit Shtern , Segev Wasserkrug , Nimrod Megiddo

PAC Learnability in the Presence of Performativity

Following the wide-spread adoption of machine learning models in real-world applications, the phenomenon of performativity, i.e. model-dependent shifts in the test distribution, becomes increasingly prevalent. Unfortunately, since models…

机器学习 · 统计学 2026-01-21 Ivan Kirev , Lyuben Baltadzhiev , Nikola Konstantinov