机器学习 — Scifaro

MEC: Machine-Learning-Assisted Generalized Entropy Calibration for Semi-Supervised Mean Estimation

Obtaining high-quality labels is costly, whereas unlabeled covariates are often abundant, motivating semi-supervised inference methods with reliable uncertainty quantification. Prediction-powered inference (PPI) leverages a machine-learning…

机器学习 · 统计学 2026-05-29 Se Yoon Lee , Jae Kwang Kim

Measure flow path recovery in Bayes Hilbert spaces

We study the ill-posed problem of recovering a probability measure flow from finitely many moving localized sensors using a Bayes Hilbert framework. Relative to a fixed reference probability measure, a probability law is represented by its…

机器学习 · 统计学 2026-05-29 S. David Mis , Maarten V. de Hoop

Learning-to-Defer with Expert-Conditional Advice

Learning-to-Defer routes each input to the expert that minimizes expected cost, but it assumes that the information available to every expert is fixed at decision time. Many modern systems violate this assumption: after selecting an expert,…

机器学习 · 统计学 2026-05-29 Yannis Montreuil , Leïna Montreuil , Axel Carlier , Lai Xing Ng , Wei Tsang Ooi

Aggregate Models, Not Explanations: Improving Feature Importance Estimation

Feature-importance methods show promise in transforming machine learning models from predictive engines into tools for scientific discovery. However, due to data sampling and algorithmic stochasticity, expressive models can be unstable,…

机器学习 · 统计学 2026-05-29 Joseph Paillard , Angel Reyero Lobo , Denis A. Engemann , Bertrand Thirion

Diffusion differentiable resampling

This paper is concerned with differentiable resampling in the context of sequential Monte Carlo (e.g., particle filtering). Drawing on reparametrisation, we propose a new resampling method that is informative and instantly differentiable,…

机器学习 · 统计学 2026-05-29 Jennifer Rosina Andersson , Zheng Zhao

BITS for GAPS: Bayesian Information-Theoretic Sampling for hierarchical GAussian Process Surrogates

We introduce Bayesian Information-Theoretic Sampling for hierarchical GAussian Process Surrogates (BITS for GAPS), a framework enabling information-theoretic experimental design of Gaussian process-based surrogate models. Unlike standard…

机器学习 · 统计学 2026-05-29 Kyla D. Jones , Alexander W. Dowling

Follow-the-Perturbed-Leader for Decoupled Bandits: Best-of-Both-Worlds and Practicality

We study the decoupled multi-armed bandit problem, where the learner separately selects one arm for exploration and one, possibly different, arm for exploitation at each round. In this setting, the loss of the explored arm is observed but…

机器学习 · 统计学 2026-05-29 Chaiwon Kim , Jongyeong Lee , Min-hwan Oh

Adversarial Robustness in One-Stage Learning-to-Defer

Learning-to-Defer (L2D) enables hybrid decision-making by routing inputs either to a predictor or to external experts. While promising, L2D is highly vulnerable to adversarial perturbations, which can not only flip predictions but also…

机器学习 · 统计学 2026-05-29 Yannis Montreuil , Letian Yu , Axel Carlier , Lai Xing Ng , Wei Tsang Ooi

Calibrating Generative Models to Distributional Constraints

Generative models frequently suffer miscalibration, wherein statistics of the sampling distribution, such as the fraction of generations in a given class, deviate from desired values. We frame calibration as a constrained optimization…

机器学习 · 统计学 2026-05-29 Henry D. Smith , Nathaniel L. Diamant , Brian L. Trippe

Permutation-Invariant Spectral Learning via Dyson Diffusion

Diffusion models are central to generative modeling and have been adapted to graphs by diffusing adjacency matrix representations. The challenge of having up to $n!$ such representations for graphs with $n$ nodes is only partially mitigated…

机器学习 · 统计学 2026-05-29 Tassilo Schwarz , Cai Dieball , Constantin Kogler , Renaud Lambiotte , Arnaud Doucet , Aljaž Godec , George Deligiannidis

SADA: Safe and Adaptive Aggregation of Multiple Black-Box Predictions in Semi-Supervised Learning

Semi-supervised learning (SSL) arises in practice when labeled data are scarce or expensive to obtain, while large quantities of unlabeled data are readily available. With the growing adoption of machine learning techniques, it has become…

机器学习 · 统计学 2026-05-29 Jiawei Shan , Zhifeng Chen , Yiming Dong , Yazhen Wang , Jiwei Zhao

Risk-averse Fair Multi-class Classification

We develop a new classification framework based on the theory of coherent risk measures and systemic risk. The proposed approach is suitable for multi-class problems when the data is noisy, scarce (relative to the dimension of the problem),…

机器学习 · 统计学 2026-05-29 Darinka Dentcheva , Xiangyu Tian

From Sublinear to Linear: Local Convergence in Finite-Width Networks via Locally Polyak-Lojasiewicz Regions

We study local linear convergence of gradient descent for finite-width feedforward networks under the squared empirical loss. Prior work shows that GD can remain confined to a Locally Quasi-Convex Region (LQCR) around initialization, but…

机器学习 · 统计学 2026-05-29 Agnideep Aich , Ashit Baran Aich , Bruce Wade

Noise-Aware Differentially Private Variational Inference

Differential privacy (DP) provides robust privacy guarantees for statistical inference, but this can lead to unreliable results and biases in downstream applications. While several noise-aware approaches have been proposed which integrate…

机器学习 · 统计学 2026-05-29 Talal Alrawajfeh , Joonas Jälkö , Antti Honkela

Beyond Lipschitz: Data-Driven Robustness via Discrete Modulus of Continuity

Robustness of neural networks is commonly quantified via local or global Lipschitz constants. However, Lipschitz continuity can be overly coarse or overly restrictive as global robustness measure, failing to capture nuanced, data-dependent…

机器学习 · 统计学 2026-05-28 Jürgen Dölz , Michael Multerer , Michele Palma

Conservative neural posterior estimation via distributionally robust training

Simulation-based inference with neural posterior estimation (NPE) often yields overconfident and unreliable posteriors under limited simulation budgets. To address this, we propose DRO-NPE, a distributionally robust approach that replaces…

机器学习 · 统计学 2026-05-28 William Laplante , Yuga Hikida , Charita Dellaporta , François-Xavier Briol , Ayush Bharti

Variance-Adaptive Optimal Algorithm for Reinforcement Learning with Multinomial Logit Function Approximation

Reinforcement learning with multinomial logistic (MNL) function approximation has become an important framework due to its flexibility and broad applicability. While existing studies have established regret guarantees under worst-case…

机器学习 · 统计学 2026-05-28 Wonyoung Kim , Min-Hwan Oh , Garud Iyengar , Assaf Zeevi

Decision-focused learning for optimal PV-Battery scheduling

The use of residential photovoltaics has increased dramatically in recent years. With battery systems becoming more affordable, the optimal operation of a photovoltaic-battery system can bring significant savings to households. Optimal…

机器学习 · 统计学 2026-05-28 Joris Depoortere , Hussain Kazmi , Johan Driesen

Counterfactually Fair Regression via Optimal Transport

We consider the problem of learning a counterfactually fair regressor. We adopt a causal uncertainty view in which counterfactual fairness is defined with resampled noise. We focus on obtaining theoretical fairness guarantees for a new…

机器学习 · 统计学 2026-05-28 M. Generali Lince , S. Gaucher , J-J. Vie , P. Loiseau

Geometry of Relaxed Fair Regression: A Unified Framework for Aware and Unaware Settings

Fairness-accuracy trade-offs are a central concern in the deployment of fairness-aware machine learning methods. When sensitive attributes are unavailable at inference time-the so called unawareness setting, principled methods for obtaining…

机器学习 · 统计学 2026-05-28 M. Generali Lince , V. Divol , R. Flamary , S. Gaucher , P. Loiseau