机器学习 — Scifaro

Neural Score Matching for High-Dimensional Causal Inference

Traditional methods for matching in causal inference are impractical for high-dimensional datasets. They suffer from the curse of dimensionality: exact matching and coarsened exact matching find exponentially fewer matches as the input…

机器学习 · 统计学 2026-02-12 Oscar Clivio , Fabian Falck , Brieuc Lehmann , George Deligiannidis , Chris Holmes

Classification of high-dimensional data with spiked covariance matrix structure

We study the classification problem for high-dimensional data with $n$ observations on $p$ features where the $p \times p$ covariance matrix $\Sigma$ exhibits a spiked eigenvalue structure and the vector $\zeta$, given by the difference…

机器学习 · 统计学 2026-02-12 Yin-Jen Chen , Minh Tang

The Catastrophic Failure of The k-Means Algorithm in High Dimensions, and How Hartigan's Algorithm Avoids It

Lloyd's k-means algorithm is one of the most widely used clustering methods. We prove that in high-dimensional, high-noise settings, the algorithm exhibits catastrophic failure: with high probability, essentially every partition of the data…

机器学习 · 统计学 2026-02-11 Roy R. Lederman , David Silva-Sánchez , Ziling Chen , Gilles Mordant , Amnon Balanov , Tamir Bendory

Stabilized Maximum-Likelihood Iterative Quantum Amplitude Estimation for Structural CVaR under Correlated Random Fields

Conditional Value-at-Risk (CVaR) is a central tail-risk measure in stochastic structural mechanics, yet its accurate evaluation under high-dimensional, spatially correlated material uncertainty remains computationally prohibitive for…

机器学习 · 统计学 2026-02-11 Alireza Tabarraei

Continual Learning for non-stationary regression via Memory-Efficient Replay

Data streams are rarely static in dynamic environments like Industry 4.0. Instead, they constantly change, making traditional offline models outdated unless they can quickly adjust to the new data. This need can be adequately addressed by…

机器学习 · 统计学 2026-02-11 Pablo García-Santaclara , Bruno Fernández-Castro , RebecaP. Díaz-Redondo , Martín Alonso-Gamarra

The Entropic Signature of Class Speciation in Diffusion Models

Diffusion models do not recover semantic structure uniformly over time. Instead, samples transition from semantic ambiguity to class commitment within a narrow regime. Recent theoretical work attributes this transition to dynamical…

机器学习 · 统计学 2026-02-11 Florian Handke , Dejan Stančević , Felix Koulischer , Thomas Demeester , Luca Ambrogioni

Is Memorization Helpful or Harmful? Prior Information Sets the Threshold

We examine the connection between training error and generalization error for arbitrary estimating procedures, working in an overparameterized linear model under general priors in a Bayesian setup. We find determining factors inherent to…

机器学习 · 统计学 2026-02-11 Chen Cheng , Rina Foygel Barber

Mutual Information Collapse Explains Disentanglement Failure in $\beta$-VAEs

The $\beta$-VAE is a foundational framework for unsupervised disentanglement, using $\beta$ to regulate the trade-off between latent factorization and reconstruction fidelity. Empirically, however, disentanglement performance exhibits a…

机器学习 · 统计学 2026-02-11 Minh Vu , Xiaoliang Wan , Shuangqing Wei

Minimum Distance Summaries for Robust Neural Posterior Estimation

Simulation-based inference (SBI) enables amortized Bayesian inference by first training a neural posterior estimator (NPE) on prior-simulator pairs, typically through low-dimensional summary statistics, which can then be cheaply reused for…

机器学习 · 统计学 2026-02-11 Sherman Khoo , Dennis Prangle , Song Liu , Mark Beaumont

Persistent Entropy as a Detector of Phase Transitions

Persistent entropy (PE) is an information-theoretic summary statistic of persistence barcodes that has been widely used to detect regime changes in complex systems. Despite its empirical success, a general theoretical understanding of when…

机器学习 · 统计学 2026-02-11 Matteo Rucco

Scalable Mean-Field Variational Inference via Preconditioned Primal-Dual Optimization

In this work, we investigate the large-scale mean-field variational inference (MFVI) problem from a mini-batch primal-dual perspective. By reformulating MFVI as a constrained finite-sum problem, we develop a novel primal-dual algorithm…

机器学习 · 统计学 2026-02-11 Jinhua Lyu , Tianmin Yu , Ying Ma , Naichen Shi

Statistical Guarantees for Reasoning Probes on Looped Boolean Circuits

We study the statistical behaviour of reasoning probes in a stylized model of looped reasoning, given by Boolean circuits whose computational graph is a perfect $\nu$-ary tree ($\nu\ge 2$) and whose output is appended to the input and fed…

机器学习 · 统计学 2026-02-11 Anastasis Kratsios , Giulia Livieri , A. Martina Neuman

Double Fairness Policy Learning: Integrating Action Fairness and Outcome Fairness in Decision-making

Fairness is a central pillar of trustworthy machine learning, especially in domains where accuracy- or profit-driven optimization is insufficient. While most fairness research focuses on supervised learning, fairness in policy learning…

机器学习 · 统计学 2026-02-11 Zeyu Bian , Lan Wang , Chengchun Shi , Zhengling Qi

Adapting Noise to Data: Generative Flows from 1D Processes

The default Gaussian latent in flow-based generative models poses challenges when learning certain distributions such as heavy-tailed ones. We introduce a general framework for learning data-adaptive latent distributions using…

机器学习 · 统计学 2026-02-11 Jannis Chemseddine , Gregor Kornhardt , Richard Duong , Gabriele Steidl

A Nonparametric Discrete Hawkes Model with a Collapsed Gaussian-Process Prior

Hawkes process models are used in settings where past events increase the likelihood of future events occurring. Many applications record events as counts on a regular grid, yet discrete-time Hawkes models remain comparatively underused and…

机器学习 · 统计学 2026-02-11 Trinnhallen Brisley , Gordon Ross , Daniel Paulin

Sharp High-Probability Rates for Nonlinear SGD under Heavy-Tailed Noise via Symmetrization

We study convergence in high-probability of SGD-type methods in non-convex optimization and the presence of heavy-tailed noise. To combat the heavy-tailed noise, a general black-box nonlinear framework is considered, subsuming…

机器学习 · 统计学 2026-02-11 Aleksandar Armacki , Dragana Bajovic , Dusan Jakovetic , Soummya Kar

Input Convex Kolmogorov Arnold Networks

This article presents an input convex neural network architecture using Kolmogorov-Arnold networks (ICKAN). Two specific networks are presented: the first is based on a low-order, linear-by-part, representation of functions, and a universal…

机器学习 · 统计学 2026-02-11 Thomas Deschatre , Xavier Warin

Learning Probabilities of Causation with Mask-Augmented Data

Probabilities of causation play a central role in modern decision making. Tian and Pearl first introduced formal definitions and derived tight bounds for three binary probabilities of causation, such as the probability of necessity and…

机器学习 · 统计学 2026-02-11 Shuai Wang , Yizhou Sun , Judea Pearl , Ang Li

Robust Reinforcement Learning from Human Feedback for Large Language Models Fine-Tuning

Reinforcement learning from human feedback (RLHF) has emerged as a key technique for aligning the output of large language models (LLMs) with human preferences. To learn the reward function, most existing RLHF algorithms use the…

机器学习 · 统计学 2026-02-11 Kai Ye , Hongyi Zhou , Jin Zhu , Francesco Quinzan , Chengchun Shi

Aggregation Models with Optimal Weights for Distributed Gaussian Processes

Gaussian process (GP) models have received increasing attention in recent years due to their superb prediction accuracy and modeling flexibility. To address the computational burdens of GP models for large-scale datasets, distributed…

机器学习 · 统计学 2026-02-11 Haoyuan Chen , Rui Tuo