机器学习 — Scifaro

Super-Level-Set Regression: Conditional Quantiles via Volume Minimization

Constructing minimum-volume prediction regions that satisfy conditional coverage is a fundamental challenge in multivariate regression. Standard approaches rely on explicitly estimating the full conditional density and subsequently…

机器学习 · 统计学 2026-05-08 Sacha Braun , Michael I. Jordan , Francis Bach

When Does Trimming Help Conformal Prediction? A Retained-Law Diagnostic under Calibration Contamination

Trimming suspicious calibration points is a common response to contamination in conformal prediction. Its effect on clean-target coverage, however, is governed by the retained law induced by trimming, not by the contamination level alone.…

机器学习 · 统计学 2026-05-08 Congye Wang

Expressivity of Bi-Lipschitz Normalizing Flows: A Score-Based Diffusion Perspective

Many normalizing flow architectures impose regularity constraints, yet their distributional approximation properties are not fully characterized. We study the expressivity of bi-Lipschitz normalizing flows through the lens of score-based…

机器学习 · 统计学 2026-05-08 Meira Iske , Carola-Bibiane Schönlieb

Gaussian mixture models in Hilbert spaces via kernel methods

Modern datasets across many disciplines increasingly consist of time-evolving, potentially infinite-dimensional random objects, such as dynamic functional data, which are naturally modeled in Hilbert spaces. In these settings,…

机器学习 · 统计学 2026-05-08 Daniel López-Montero , Antonio Álvarez-López , Marcos Matabuena

TabCF: Distributional Control Function Estimation with Tabular Foundation Models

Instrumental variable (IV) and control function (CF) methods are powerful tools for causal effect estimation in the presence of unmeasured confounding, yet most existing approaches target only mean effects and/or demand substantial fitting…

机器学习 · 统计学 2026-05-08 Geping Chen , Chunlin Li , Tianzhong Yang , Zhengyuan Zhu , Jing Zhou

Towards Reliable LLM Evaluation: Correcting the Winner's Curse in Adaptive Benchmarking

Adaptive prompt and program search makes LLM evaluation selection-sensitive. Once benchmark items are reused inside tuning, the observed winner's score need not estimate the fresh-data performance of the full tune-then-deploy procedure. We…

机器学习 · 统计学 2026-05-08 Yang Xu , Jiefu Zhang , Haixiang Sun , Zihan Zhou , Tianyu Cao , Vaneet Aggarwal

Tuning Derivatives for Causal Fairness in Machine Learning

Artificial-intelligence systems are becoming ubiquitous in society, yet their predictions typically inherit biases with respect to protected attributes such as race, gender, or age. Classical fairness notions, most notably Statistical…

机器学习 · 统计学 2026-05-08 Filip Edström , Guilherme W. F. Barros , Tetiana Gorbach , Xavier de Luna

CITE: Anytime-Valid Statistical Inference in LLM Self-Consistency

Large language models often improve reasoning by sampling multiple outputs and aggregating their final answers, but precise and efficient control of error levels remains a challenging task. In particular, deciding when to stop sampling…

机器学习 · 统计学 2026-05-08 Hirofumi Ota , Naoto Iwase , Yuki Ichihara , Junpei Komiyama , Masaaki Imaizumi

Ratio-based Loss Functions

Algorithms in machine learning and AI do critically depend on at least three key components: (i) the risk function, which is the expectation of the loss function, (ii) the function space, which is often called the hypothesis space, and…

机器学习 · 统计学 2026-05-08 Lena Helgerth , Andreas Christmann

Transformers Provably Implement In-Context Reinforcement Learning with Policy Improvement

We investigate the ability of transformers to perform in-context reinforcement learning (ICRL), where a model must infer and execute learning algorithms from trajectory data without parameter updates. We show that a linear self-attention…

机器学习 · 统计学 2026-05-08 Haodong Liang , Lifeng Lai

Spectral Lens: Activation and Gradient Spectra as Diagnostics of LLM Optimization

Training loss and throughput can hide distinct internal representation in language-model training. To examine these hidden mechanics, we use spectral measurements as practical and operational diagnostics. Using a controlled family of…

机器学习 · 统计学 2026-05-08 Andy Zeyi Liu , Elliot Paquette , John Sous

Variational Smoothing and Inference for SDEs from Sparse Data with Dynamic Neural Flows

Stochastic differential equations (SDEs) provide a flexible framework for modeling temporal dynamics in partially observed systems. A central task is to calibrate such models from data, which requires inferring latent trajectories and…

机器学习 · 统计学 2026-05-08 Yu Wang , Arnab Ganguly

In-Context Positive-Unlabeled Learning

Positive-unlabeled (PU) learning addresses binary classification when only a set of labeled positives is available alongside a pool of unlabeled samples drawn from a mixture of positives and negatives. Existing PU methods typically require…

机器学习 · 统计学 2026-05-08 Siyan Liu , Yi Chang , Manli Cheng , Qinglong Tian , Pengfei Li

Relaxed Sparsest-Permutation Formulation for Causal Discovery at Scale

Despite the growing availability of large datasets, causal structure learning remains computationally prohibitive at scale. We revisit sparsest-permutation learning for linear structural equation models and show that exact Cholesky…

机器学习 · 统计学 2026-05-08 Sunmin Oh , Sang-Yun Oh , Gunwoong Park

Permutation-preserving Functions and Neural Vecchia Covariance Kernels

We introduce a novel framework for constructing scalable and flexible covariance kernels for Gaussian processes (GPs) by directly learning the covariance structure under a regression-type parameterization induced by Vecchia approximations,…

机器学习 · 统计学 2026-05-08 Jian Cao , Nian Liu , Ying Lin

Convexity in Disguise: A Theoretical Framework for Nonconvex Low-Rank Matrix Estimation

Nonconvex methods have emerged as a dominant approach for low-rank matrix estimation, a problem that arises widely in machine learning and AI for learning and representing high-dimensional data. Existing analyses for these methods often…

机器学习 · 统计学 2026-05-08 Chengyu Cui , Gongjun Xu

Estimating Implicit Regularization in Deep Learning

Deep learning systems are known to exhibit implicit regularization (alt. implicit bias), favoring simple solutions instead of merely minimizing the loss function. In some cases, we can analytically derive the implicit regularization --…

机器学习 · 统计学 2026-05-08 Joseph H. Rudoler , Kevin Tan , Giles Hooker , Konrad P. Kording

Forecasting Oncology Demand Trends with Boosting-Based Bayesian Conjugate Models

Accurate trend forecasting in healthcare time series is essential for planning and resource allocation. This paper proposes a Bayesian framework for predicting oncology demand trends, modeling weekly appointments as a Poisson process with a…

机器学习 · 统计学 2026-05-08 Ademir Batista dos Santos Neto , Tiago Alessandro Espinola Ferreira , Paulo Renato Alves Firmino

Maximizing Rollout Informativeness under a Fixed Budget: A Submodular View of Tree Search for Tool-Use Agentic Reinforcement Learning

We formalize Rollout Informativeness under a Fixed Budget (RIFB) as the expected non-vanishing policy-gradient mass that a tool-use rollout set injects into Group Relative Policy Optimization (GRPO). We prove that any budget-agnostic…

机器学习 · 统计学 2026-05-08 Yuelin Hu , Zhenbo Yu , Zhengxue Cheng , Wei Liu , Li Song

Dynamic Vine Copulas: Detecting and Quantifying Time-Varying Higher-Order Interactions

Time-varying dependence is often modeled with dynamic correlations or Gaussian graphical models, but multivariate systems can change through tail behavior, asymmetry, or conditional structure even when correlations are nearly stable. We…

机器学习 · 统计学 2026-05-08 Houman Safaai , Alessandro Marin Vargas