机器学习 — Scifaro

On Theoretical Identifiability of Discrete Latent Causal Graphical Models

This paper considers a challenging problem of identifying a causal graphical model under the presence of latent variables. While various identifiability conditions have been proposed in the literature, they often require multiple pure…

机器学习 · 统计学 2026-02-03 Seunghyun Lee , Yuqi Gu

The Nuclear Route: Sharp Asymptotics of ERM in Overparameterized Quadratic Networks

We study the high-dimensional asymptotics of empirical risk minimization (ERM) in over-parametrized two-layer neural networks with quadratic activations trained on synthetic data. We derive sharp asymptotics for both training and test…

机器学习 · 统计学 2026-02-03 Vittorio Erba , Emanuele Troiani , Lenka Zdeborová , Florent Krzakala

Transportability without Graphs: A Bayesian Approach to Identifying s-Admissible Backdoor Sets

Transporting causal information across populations is a critical challenge in clinical decision-making. Causal modeling provides criteria for identifiability and transportability, but these require knowledge of the causal graph, which…

机器学习 · 统计学 2026-02-03 Konstantina Lelova , Gregory F. Cooper , Sofia Triantafillou

DOLCE: Decomposing Off-Policy Evaluation/Learning into Lagged and Current Effects

Off-policy evaluation and learning in contextual bandits use logged interaction data to estimate and optimize the value of a target policy. Most existing methods require sufficient action overlap between the logging and target policies, and…

机器学习 · 统计学 2026-02-03 Shu Tamano

Graph Max Shift: A Hill-Climbing Method for Graph Clustering

We present a method for graph clustering that is analogous to gradient ascent methods previously proposed for clustering points in space. The algorithm, which can be viewed as a max-degree hill-climbing procedure on the graph, iteratively…

机器学习 · 统计学 2026-02-03 Ery Arias-Castro , Elizabeth Coda , Wanli Qiao

Joint Bayesian Parameter and Model Order Estimation for Low-Rank Probability Mass Tensors

Obtaining a reliable estimate of the joint probability mass function (PMF) of a set of random variables from observed data is a significant objective in statistical signal processing and machine learning. Modelling the joint PMF as a tensor…

机器学习 · 统计学 2026-02-03 Joseph K. Chege , Arie Yeredor , Martin Haardt

VC Theory for Inventory Policies

There has been growing interest in applying reinforcement learning (RL) to inventory management, either by optimizing over temporal transitions or by learning directly from full historical demand trajectories. This contrasts sharply with…

机器学习 · 统计学 2026-02-03 Yaqi Xie , Will Ma , Linwei Xin

Graph Attention Network for Node Regression on Random Geometric Graphs with Erd\H{o}s--R\'enyi contamination

Graph attention networks (GATs) are widely used and often appear robust to noise in node covariates and edges, yet rigorous statistical guarantees demonstrating a provable advantage of GATs over non-attention graph neural networks~(GNNs)…

机器学习 · 统计学 2026-02-02 Somak Laha , Suqi Liu , Morgane Austern

A Random Matrix Theory of Masked Self-Supervised Regression

In the era of transformer models, masked self-supervised learning (SSL) has become a foundational training paradigm. A defining feature of masked SSL is that training aggregates predictions across many masking patterns, giving rise to a…

机器学习 · 统计学 2026-02-02 Arie Wortsman Zurich , Federica Gerace , Bruno Loureiro , Yue M. Lu

Asymptotic Theory of Iterated Empirical Risk Minimization, with Applications to Active Learning

We study a class of iterated empirical risk minimization (ERM) procedures in which two successive ERMs are performed on the same dataset, and the predictions of the first estimator enter as an argument in the loss function of the second.…

机器学习 · 统计学 2026-02-02 Hugo Cui , Yue M. Lu

OneFlowSBI: One Model, Many Queries for Simulation-Based Inference

We introduce \textit{OneFlowSBI}, a unified framework for simulation-based inference that learns a single flow-matching generative model over the joint distribution of parameters and observations. Leveraging a query-aware masking…

机器学习 · 统计学 2026-02-02 Mayank Nautiyal , Li Ju , Melker Ernfors , Klara Hagland , Ville Holma , Maximilian Werkö Söderholm , Andreas Hellander , Prashant Singh

Approximating $f$-Divergences with Rank Statistics

We introduce a rank-statistic approximation of $f$-divergences that avoids explicit density-ratio estimation by working directly with the distribution of ranks. For a resolution parameter $K$, we map the mismatch between two univariate…

机器学习 · 统计学 2026-02-02 Viktor Stein , José Manuel de Frutos

GRANITE: A Generalized Regional Framework for Identifying Agreement in Feature-Based Explanations

Feature-based explanation methods aim to quantify how features influence the model's behavior, either locally or globally, but different methods often disagree, producing conflicting explanations. This disagreement arises primarily from two…

机器学习 · 统计学 2026-02-02 Julia Herbinger , Gabriel Laberge , Maximilian Muschalik , Yann Pequignot , Marvin N. Wright , Fabian Fumagalli

Spectral Gradient Descent Mitigates Anisotropy-Driven Misalignment: A Case Study in Phase Retrieval

Spectral gradient methods, such as the Muon optimizer, modify gradient updates by preserving directional information while discarding scale, and have shown strong empirical performance in deep learning. We investigate the mechanisms…

机器学习 · 统计学 2026-02-02 Guillaume Braun , Han Bao , Wei Huang , Masaaki Imaizumi

Generative and Nonparametric Approaches for Conditional Distribution Estimation: Methods, Perspectives, and Comparative Evaluations

The inference of conditional distributions is a fundamental problem in statistics, essential for prediction, uncertainty quantification, and probabilistic modeling. A wide range of methodologies have been developed for this task. This…

机器学习 · 统计学 2026-02-02 Yen-Shiu Chin , Zhi-Yu Jou , Toshinari Morimoto , Chia-Tse Wang , Ming-Chung Chang , Tso-Jung Yen , Su-Yun Huang , Tailen Hsing

RPWithPrior: Label Differential Privacy in Regression

With the wide application of machine learning techniques in practice, privacy preservation has gained increasing attention. Protecting user privacy with minimal accuracy loss is a fundamental task in the data analysis and mining community.…

机器学习 · 统计学 2026-02-02 Haixia Liu , Ruifan Huang

An Efficient Algorithm for Thresholding Monte Carlo Tree Search

We introduce the Thresholding Monte Carlo Tree Search problem, in which, given a tree $\mathcal{T}$ and a threshold $\theta$, a player must answer whether the root node value of $\mathcal{T}$ is at least $\theta$ or not. In the given tree,…

机器学习 · 统计学 2026-02-02 Shoma Nameki , Atsuyoshi Nakamura , Junpei Komiyama , Koji Tabata

Simulation-based Bayesian inference with ameliorative learned summary statistics -- Part I

This paper, which is Part 1 of a two-part paper series, considers a simulation-based inference with learned summary statistics, in which such a learned summary statistic serves as an empirical-likelihood with ameliorative effects in the…

机器学习 · 统计学 2026-02-02 Getachew K. Befekadu

Dependence-Aware Label Aggregation for LLM-as-a-Judge via Ising Models

Large-scale AI evaluation increasingly relies on aggregating binary judgments from $K$ annotators, including LLMs used as judges. Most classical methods, e.g., Dawid-Skene or (weighted) majority voting, assume annotators are conditionally…

机器学习 · 统计学 2026-02-02 Krishnakumar Balasubramanian , Aleksandr Podkopaev , Shiva Prasad Kasiviswanathan

Optimal Transport under Group Fairness Constraints

Ensuring fairness in matching algorithms is a key challenge in allocating scarce resources and positions. Focusing on Optimal Transport (OT), we introduce a novel notion of group fairness requiring that the probability of matching two…

机器学习 · 统计学 2026-02-02 Linus Bleistein , Mathieu Dagréou , Francisco Andrade , Thomas Boudou , Aurélien Bellet