机器学习 — Scifaro

CGRL: Causal-Guided Representation Learning for Graph Out-of-Distribution Generalization

Graph Neural Networks (GNNs) have achieved impressive performance in graph-related tasks. However, they suffer from poor generalization on out-of-distribution (OOD) data, as they tend to learn spurious correlations. Such correlations…

机器学习 · 统计学 2026-03-26 Bowen Lu , Liangqiang Yang , Teng Li

Beyond Consistency: Inference for the Relative risk functional in Deep Nonparametric Cox Models

There remain theoretical gaps in deep neural network estimators for the nonparametric Cox proportional hazards model. In particular, it is unclear how gradient-based optimization error propagates to population risk under partial likelihood,…

机器学习 · 统计学 2026-03-26 Sattwik Ghosal , Xuran Meng , Yi Li

Wasserstein Parallel Transport for Predicting the Dynamics of Statistical Systems

Many scientific systems, such as cellular populations or economic cohorts, are naturally described by probability distributions that evolve over time. Predicting how such a system would have evolved under different forces or initial…

机器学习 · 统计学 2026-03-26 Tristan Luca Saidi , Gonzalo Mena , Larry Wasserman , Florian Gunsilius

The Mass Agreement Score: A Point-centric Measure of Cluster Size Consistency

In clustering, strong dominance in the size of a particular cluster is often undesirable, motivating a measure of cluster size uniformity that can be used to filter such partitions. A basic requirement of such a measure is stability:…

机器学习 · 统计学 2026-03-26 Randolph Wiredu-Aidoo

Bayes with No Shame: Admissibility Geometries of Predictive Inference

Four distinct admissibility geometries govern sequential and distribution-free inference: Blackwell risk dominance over convex risk sets, anytime-valid admissibility within the nonnegative supermartingale cone, marginal coverage validity…

机器学习 · 统计学 2026-03-26 Nicholas G. Polson , Daniel Zantedeschi

Deep Learning as a Convex Paradigm of Computation: Minimizing Circuit Size with ResNets

This paper argues that DNNs implement a computational Occam's razor -- finding the `simplest' algorithm that fits the data -- and that this could explain their incredible and wide-ranging success over more traditional statistical methods.…

机器学习 · 统计学 2026-03-26 Arthur Jacot

Accelerated Parallel Tempering via Neural Transports

Markov Chain Monte Carlo (MCMC) algorithms are essential tools in computational statistics for sampling from unnormalised probability distributions, but can be fragile when targeting high-dimensional, multimodal, or complex target…

机器学习 · 统计学 2026-03-26 Leo Zhang , Peter Potaptchik , Jiajun He , Yuanqi Du , Arnaud Doucet , Francisco Vargas , Hai-Dang Dau , Saifuddin Syed

Algorithms with Calibrated Machine Learning Predictions

The field of algorithms with predictions incorporates machine learning advice in the design of online algorithms to improve real-world performance. A central consideration is the extent to which predictions can be trusted -- while existing…

机器学习 · 统计学 2026-03-26 Judy Hanwen Shen , Ellen Vitercik , Anders Wikum

Hamiltonian Mechanics of Feature Learning: Bottleneck Structure in Leaky ResNets

We study Leaky ResNets, which interpolate between ResNets and Fully-Connected nets depending on an 'effective depth' hyper-parameter $\tilde{L}$. In the infinite depth limit, we study 'representation geodesics' $A_{p}$: continuous paths in…

机器学习 · 统计学 2026-03-26 Arthur Jacot , Alexandre Kaiser

Contextual Graph Matching with Correlated Gaussian Features

We investigate contextual graph matching in the Gaussian setting, where both edge weights and node features are correlated across two networks. We derive precise information-theoretic thresholds for exact recovery, and identify conditions…

机器学习 · 统计学 2026-03-25 Mohammad Hassan Ahmad Yarandi , Luca Ganassali

Between Resolution Collapse and Variance Inflation: Weighted Conformal Anomaly Detection in Low-Data Regimes

Standard conformal anomaly detection provides marginal finite-sample guarantees under the assumption of exchangeability . However, real-world data often exhibit distribution shifts, necessitating a weighted conformal approach to adapt to…

机器学习 · 统计学 2026-03-25 Oliver Hennhöfer , Christine Preisach

High-Resolution Tensor-Network Fourier Methods for Exponentially Compressed Non-Gaussian Aggregate Distributions

Characteristic functions of weighted sums of independent random variables exhibit low-rank structure in the quantized tensor train (QTT) representation, also known as matrix product states (MPS), enabling up to exponential compression of…

机器学习 · 统计学 2026-03-25 Juan José Rodríguez-Aldavero , Juan José García-Ripoll

Stepwise Variational Inference with Vine Copulas

We propose stepwise variational inference (VI) with vine copulas: a universal VI procedure that combines vine copulas with a novel stepwise estimation procedure of the variational parameters. Vine copulas consist of a nested sequence of…

机器学习 · 统计学 2026-03-25 Elisabeth Griesbauer , Leiv Rønneberg , Arnoldo Frigessi , Claudia Czado , Ingrid Hobæk Haff

REALITrees: Rashomon Ensemble Active Learning for Interpretable Trees

Active learning reduces labeling costs by selecting samples that maximize information gain. A dominant framework, Query-by-Committee (QBC), typically relies on perturbation-based diversity by inducing model disagreement through random…

机器学习 · 统计学 2026-03-25 Simon D. Nguyen , Hayden McTavish , Kentaro Hoffman , Cynthia Rudin , Tyler H. McCormick

Overfitting and Generalizing with (PAC) Bayesian Prediction in Noisy Binary Classification

We consider a PAC-Bayes type learning rule for binary classification, balancing the training error of a randomized ''posterior'' predictor with its KL divergence to a pre-specified ''prior''. This can be seen as an extension of a modified…

机器学习 · 统计学 2026-03-25 Xiaohan Zhu , Mesrob I. Ohannessian , Nathan Srebro

Privacy-Preserving Reinforcement Learning from Human Feedback via Decoupled Reward Modeling

Preference-based fine-tuning has become an important component in training large language models, and the data used at this stage may contain sensitive user information. A central question is how to design a differentially private pipeline…

机器学习 · 统计学 2026-03-25 Young Hyun Cho , Will Wei Sun

SPDE Methods for Nonparametric Bayesian Posterior Contraction and Laplace Approximation

We derive posterior contraction rates (PCRs) and finite-sample Bernstein von Mises (BvM) results for non-parametric Bayesian models by extending the diffusion-based framework of Mou et al. (2024) to the infinite-dimensional setting. The…

机器学习 · 统计学 2026-03-25 Enric Alberola-Boloix , Ioar Casado-Telletxea

Demystifying Low-Rank Knowledge Distillation in Large Language Models: Convergence, Generalization, and Information-Theoretic Guarantees

Knowledge distillation has emerged as a powerful technique for compressing large language models (LLMs) into efficient, deployable architectures while preserving their advanced capabilities. Recent advances in low-rank knowledge…

机器学习 · 统计学 2026-03-25 Alberlucia Rafael Soarez , Daniel Kim , Mariana Costa , Alejandro Torre

Decorrelation, Diversity, and Emergent Intelligence: The Isomorphism Between Social Insect Colonies and Ensemble Machine Learning

Social insect colonies and ensemble machine learning methods represent two of the most successful examples of decentralized information processing in nature and computation respectively. Here we develop a rigorous mathematical framework…

机器学习 · 统计学 2026-03-25 Ernest Fokoué , Gregory Babbitt , Yuval Levental

Deep Adaptive Model-Based Design of Experiments

Model-based design of experiments (MBDOE) is essential for efficient parameter estimation in nonlinear dynamical systems. However, conventional adaptive MBDOE requires costly posterior inference and design optimization between each…

机器学习 · 统计学 2026-03-25 Arno Strouwen , Sebastian Micluţa-Câmpeanu