机器学习 — Scifaro

Semiparametric Efficient Test for Interpretable Distributional Treatment Effects

Distributional treatment effects can be invisible to means: a treatment may preserve average outcomes while changing tails, modes, dispersion, or rare-event probabilities. Kernel tests can detect discrepancies between interventional outcome…

机器学习 · 统计学 2026-05-11 Houssam Zenati , Arthur Gretton

Consistency Regularised Gradient Flows for Inverse Problems

Vision-Language Latent Diffusion Models (LDMs) (Rombach et al., 2022) provide powerful generative priors for inverse problems. However, existing LDM-based inverse solvers typically require a large number of neural function evaluations…

机器学习 · 统计学 2026-05-11 Alessio Spagnoletti , Tim Y. J. Wang , Marcelo Pereyra , O. Deniz Akyildiz

Characterizing and Correcting Effective Target Shift in Online Learning

Online learning from a stream of data is a defining feature of intelligence, yet modern machine learning systems often struggle in this setting, especially under distributional shift. To understand its basic properties, we study the…

机器学习 · 统计学 2026-05-11 Ziyan Li , Naoki Hiratani

Expectation-Maximization as a Spectrally Governed Relaxation Flow

The expectation--maximization (EM) algorithm combines global monotonicity, local linear convergence, and strong practical robustness, but these features are usually analyzed separately. Global descent is nonlinear, whereas local convergence…

机器学习 · 统计学 2026-05-11 Qiao Wang

Flow Matching for Count Data

High-dimensional count data arise in applications such as single-cell RNA sequencing and neural spike trains, where mapping between distributions across successive batches or time points form critical components of data analysis. The recent…

机器学习 · 统计学 2026-05-11 Ganchao Wei , John Pearson

TopoFisher: Learning Topological Summary Statistics by Maximizing Fisher Information

Persistence diagrams provide stable, interpretable summaries of geometric and topological structure and are useful for simulation-based inference when low-order statistics miss key information. Yet persistence-based pipelines require…

机器学习 · 统计学 2026-05-11 Matteo Biagetti , Mathieu Carrière , Francesco Conti , Enrico Maria Ferrari , Sven Heydenreich , Karthik Viswanathan

Debiased Counterfactual Generation via Flow Matching from Observations

Estimating counterfactual distributions under interventions is central to treatment risk assessment and counterfactual generation tasks. Existing approaches model the counterfactual distribution as a standalone generative target, without…

机器学习 · 统计学 2026-05-11 Hugh Dance , Johnny Xi , Peter Orbanz , Benjamin Bloem-Reddy

Reliable Chain-of-Thought via Prefix Consistency

Large Language Models often improve accuracy on reasoning tasks by sampling multiple Chain-of-Thought (CoT) traces and aggregating them with majority voting (MV), a test-time technique called self-consistency. When we truncate a CoT partway…

机器学习 · 统计学 2026-05-11 Naoto Iwase , Yuki Ichihara , Mohammad Atif Quamar , Junpei Komiyama

Spectrum-Adaptive Generalization Bounds for Trained Deep Transformers

Understanding why trained Transformers generalize well is a fundamental problem in modern machine learning theory, and complexity-based generalization bounds provide a principled way to study this question. While existing norm-based bounds…

机器学习 · 统计学 2026-05-11 Mana Sakai , Masaaki Imaizumi

Classification Fields: Arbitrarily Fine Recursive Hierarchical Clustering From Few Examples

Classical clustering methods usually return either a finite partition of the observed data or a finite dendrogram over it. This finite-sample view is inadequate when the hierarchy of interest is a recursive geometric object with fine-scale…

机器学习 · 统计学 2026-05-11 Yicen Li , Ruiyang Hong , Anastasis Kratsios , Haitz Sáez de Ocáriz Borde , Paul D. McNicholas

TRACE: Transport Alignment Conformal Prediction via Diffusion and Flow Matching Models

Constructing valid and informative conformal prediction regions for multi-dimensional outputs remains a fundamental challenge. While conformal prediction provides finite-sample, distribution-free coverage guarantees, its practical…

机器学习 · 统计学 2026-05-11 Zhenhan Fang , Aixin Tan , Jian Huang

Every Feedforward Neural Network Definable in an o-Minimal Structure Has Finite Sample Complexity

We show that, in a precise sense, a broad class of feedforward neural networks learn (have finite sample complexity) in the PAC model: every fixed finite feedforward architecture whose layers are definable in an o-minimal structure has…

机器学习 · 统计学 2026-05-11 Anastasis Kratsios , Gregory Cousins , Haitz Sáez de Ocáriz Borde , Bum Jun Kim , Simone Brugiapaglia

Causal EpiNets: Precision-corrected Bounds on Individual Treatment Effects using Epistemic Neural Networks

Individual treatment effects are not point-identified from data. The Probability of Necessity and Sufficiency (PNS) circumvents this limitation by characterizing individual-level causality through intersection bounds derived from combined…

机器学习 · 统计学 2026-05-11 Gandharv Patil , Keyi Tang , Raquel Aoki , Leo Guelman

An Interpretable and Scalable Framework for Evaluating Large Language Models

Evaluation of large language models (LLMs) is increasingly critical, yet standard benchmarking methods rely on average accuracy, overlooking both the inherent stochasticity of LLM outputs and the heterogeneity of benchmark items. Item…

机器学习 · 统计学 2026-05-11 Xinhao Qu , Qiang Heng , Hao Zeng , Xiaoqian Liu

BGM-IV: an AI-powered Bayesian generative modeling approach for instrumental variable analysis

Instrumental-variable (IV) regression enables causal estimation under endogeneity, but modern IV problems often involve nonlinear structural effects and high-dimensional covariates. Existing nonlinear IV methods directly learn the causal…

机器学习 · 统计学 2026-05-11 Guyue Luo , Qiao Liu

A Differentiable Bayesian Relaxation for Latent Partial-Order Inference

Many ranking and agent trace datasets are recorded as linear orders even though their latent structure is only partially ordered. This is especially common in agent and workflow traces, where observed order may reflect arbitrary…

机器学习 · 统计学 2026-05-11 Dongqing Li , Geoff K. Nicholls , Shiyi Sun , You Luo

Locally Near Optimal Piecewise Linear Regression in High Dimensions via Difference of Max-Affine Functions

This paper presents a parametric solution to piecewise linear regression through the Adaptive Block Gradient Descent (ABGD) algorithm. The heart of the method is the parametrization of piecewise linear functions as the difference of…

机器学习 · 统计学 2026-05-11 Haitham Kanj , Kiryung Lee

Kernel Selection is Model Selection: A Unified Complexity-Penalized Approach for MMD Two-Sample Tests

The Maximum Mean Discrepancy (MMD) is a cornerstone statistic for nonparametric two-sample testing, but its test power is dictated entirely by the chosen kernel. Because any fixed kernel inherently fails to distinguish certain…

机器学习 · 统计学 2026-05-11 Yijin Ni , Xiaoming Huo

How Does Attention Help? Insights from Random Matrices on Signal Recovery from Sequence Models

We study the spectral properties of sample covariance matrices constructed from pooled sequence representations, where token embeddings are drawn from a fixed two-class Gaussian mixture table and pooled via (fixed) attention weights.…

机器学习 · 统计学 2026-05-11 Mohamed El Amine Seddik

Beyond Bellman: High-Order Generator Regression for Continuous-Time Policy Evaluation

We study finite-horizon continuous-time policy evaluation from discrete closed-loop trajectories under time-inhomogeneous dynamics. The target value surface solves a backward parabolic equation, but the Bellman baseline obtained from…

机器学习 · 统计学 2026-05-11 Yaowei Zheng , Richong Zhang , Shenxi Wu , Shirui Bian , Haosong Zhang , Li Zeng , Xingjian Ma , Yichi Zhang