机器学习 — Scifaro

Binary Hypothesis Testing for Softmax Models and Leverage Score Models

Softmax distributions are widely used in machine learning, including Large Language Models (LLMs), where the attention unit uses softmax distributions. We abstract the attention unit as the softmax model, where given a vector input, the…

机器学习 · 统计学 2025-06-02 Yuzhou Gu , Zhao Song , Junze Yin

Information Leakage Detection through Approximate Bayes-optimal Prediction

In today's data-driven world, the proliferation of publicly available information raises security concerns due to the information leakage (IL) problem. IL involves unintentionally exposing sensitive information to unauthorized parties via…

机器学习 · 统计学 2025-06-02 Pritha Gupta , Marcel Wever , Eyke Hüllermeier

Nonparametric Additive Value Functions: Interpretable Reinforcement Learning with an Application to Surgical Recovery

We propose a nonparametric additive model for estimating interpretable value functions in reinforcement learning, with an application in optimizing postoperative recovery through personalized, adaptive recommendations. While reinforcement…

机器学习 · 统计学 2025-06-02 Patrick Emedom-Nnamdi , Timothy R. Smith , Jukka-Pekka Onnela , Junwei Lu

Nested Nonparametric Instrumental Variable Regression

Several causal parameters in short panel data models are functionals of a nested nonparametric instrumental variable regression (nested NPIV). Recent examples include mediated, time varying, and long term treatment effects identified using…

机器学习 · 统计学 2025-06-02 Isaac Meza , Rahul Singh

Instance-Optimality for Private KL Distribution Estimation

We study the fundamental problem of estimating an unknown discrete distribution $p$ over $d$ symbols, given $n$ i.i.d. samples from the distribution. We are interested in minimizing the KL divergence between the true distribution and the…

机器学习 · 统计学 2025-05-30 Jiayuan Ye , Vitaly Feldman , Kunal Talwar

Multilook Coherent Imaging: Theoretical Guarantees and Algorithms

Multilook coherent imaging is a widely used technique in applications such as digital holography, ultrasound imaging, and synthetic aperture radar. A central challenge in these systems is the presence of multiplicative noise, commonly known…

机器学习 · 统计学 2025-05-30 Xi Chen , Soham Jana , Christopher A. Metzler , Arian Maleki , Shirin Jalali

Learning Parametric Distributions from Samples and Preferences

Recent advances in language modeling have underscored the role of preference feedback in enhancing model performance. This paper investigates the conditions under which preference feedback improves parameter estimation in classes of…

机器学习 · 统计学 2025-05-30 Marc Jourdan , Gizem Yüce , Nicolas Flammarion

JAPAN: Joint Adaptive Prediction Areas with Normalising-Flows

Conformal prediction provides a model-agnostic framework for uncertainty quantification with finite-sample validity guarantees, making it an attractive tool for constructing reliable prediction sets. However, existing approaches commonly…

机器学习 · 统计学 2025-05-30 Eshant English , Christoph Lippert

Finite-Sample Convergence Bounds for Trust Region Policy Optimization in Mean-Field Games

We introduce Mean-Field Trust Region Policy Optimization (MF-TRPO), a novel algorithm designed to compute approximate Nash equilibria for ergodic Mean-Field Games (MFG) in finite state-action spaces. Building on the well-established…

机器学习 · 统计学 2025-05-30 Antonio Ocello , Daniil Tiapkin , Lorenzo Mancini , Mathieu Laurière , Eric Moulines

A False Discovery Rate Control Method Using a Fully Connected Hidden Markov Random Field for Neuroimaging Data

False discovery rate (FDR) control methods are essential for voxel-wise multiple testing in neuroimaging data analysis, where hundreds of thousands or even millions of tests are conducted to detect brain regions associated with…

机器学习 · 统计学 2025-05-30 Taehyo Kim , Qiran Jia , Mony J. de Leon , Hai Shu

A Refined Analysis of UCBVI

In this work, we provide a refined analysis of the UCBVI algorithm (Azar et al., 2017), improving both the bonus terms and the regret analysis. Additionally, we compare our version of UCBVI with both its original version and the…

机器学习 · 统计学 2025-05-30 Simone Drago , Marco Mussi , Alberto Maria Metelli

Instance-dependent Convergence Theory for Diffusion Models

Score-based diffusion models have demonstrated outstanding empirical performance in machine learning and artificial intelligence, particularly in generating high-quality new samples from complex probability distributions. Improving the…

机器学习 · 统计学 2025-05-30 Yuchen Jiao , Gen Li

JANET: Joint Adaptive predictioN-region Estimation for Time-series

Conformal prediction provides machine learning models with prediction sets that offer theoretical guarantees, but the underlying assumption of exchangeability limits its applicability to time series data. Furthermore, existing approaches…

机器学习 · 统计学 2025-05-30 Eshant English , Eliot Wong-Toi , Matteo Fontana , Stephan Mandt , Padhraic Smyth , Christoph Lippert

Approximate Thompson Sampling for Learning Linear Quadratic Regulators with $O(\sqrt{T})$ Regret

We propose a novel Thompson sampling algorithm that learns linear quadratic regulators (LQR) with a Bayesian regret bound of $O(\sqrt{T})$. Our method leverages Langevin dynamics with a carefully designed preconditioner and incorporates a…

机器学习 · 统计学 2025-05-30 Yeoneung Kim , Gihun Kim , Jiwhan Park , Insoon Yang

Improvement-Focused Causal Recourse (ICR)

Algorithmic recourse recommendations, such as Karimi et al.'s (2021) causal recourse (CR), inform stakeholders of how to act to revert unfavourable decisions. However, some actions lead to acceptance (i.e., revert the model's decision) but…

机器学习 · 统计学 2025-05-30 Gunnar König , Timo Freiesleben , Moritz Grosse-Wentrup

Principled Out-of-Distribution Generalization via Simplicity

Modern foundation models exhibit remarkable out-of-distribution (OOD) generalization, solving tasks far beyond the support of their training data. However, the theoretical principles underpinning this phenomenon remain elusive. This paper…

机器学习 · 统计学 2025-05-29 Jiawei Ge , Amanda Wang , Shange Tang , Chi Jin

Hypothesis Testing in Imaging Inverse Problems

This paper proposes a framework for semantic hypothesis testing tailored to imaging inverse problems. Modern imaging methods struggle to support hypothesis testing, a core component of the scientific method that is essential for the…

机器学习 · 统计学 2025-05-29 Yiming Xi , Konstantinos Zygalakis , Marcelo Pereyra

Computing Optimal Transport Maps and Wasserstein Barycenters Using Conditional Normalizing Flows

We present a novel method for efficiently computing optimal transport maps and Wasserstein barycenters in high-dimensional spaces. Our approach uses conditional normalizing flows to approximate the input distributions as invertible…

机器学习 · 统计学 2025-05-29 Gabriele Visentin , Patrick Cheridito

Individualised Counterfactual Examples Using Conformal Prediction Intervals

Counterfactual explanations for black-box models aim to pr ovide insight into an algorithmic decision to its recipient. For a binary classification problem an individual counterfactual details which features might be changed for the model…

机器学习 · 统计学 2025-05-29 James M. Adams , Gesine Reinert , Lukasz Szpruch , Carsten Maple , Andrew Elliott

Learning Curves of Stochastic Gradient Descent in Kernel Regression

This paper considers a canonical problem in kernel regression: how good are the model performances when it is trained by the popular online first-order algorithms, compared to the offline ones, such as ridge and ridgeless regression? In…

机器学习 · 统计学 2025-05-29 Haihan Zhang , Weicheng Lin , Yuanshi Liu , Cong Fang