机器学习 — Scifaro

Structured Difference-of-Q via Orthogonal Learning

Offline reinforcement learning is important in many settings with available observational data but the inability to deploy new policies online due to safety, cost, and other concerns. Many recent advances in causal inference and machine…

机器学习 · 统计学 2025-07-08 Defu Cao , Angela Zhou

Statistical Significance of Feature Importance Rankings

Feature importance scores are ubiquitous tools for understanding the predictions of machine learning models. However, many popular attribution methods suffer from high instability due to random sampling. Leveraging novel ideas from…

机器学习 · 统计学 2025-07-08 Jeremy Goldwasser , Giles Hooker

Model-free Posterior Sampling via Learning Rate Randomization

In this paper, we introduce Randomized Q-learning (RandQL), a novel randomized model-free algorithm for regret minimization in episodic Markov Decision Processes (MDPs). To the best of our knowledge, RandQL is the first tractable model-free…

机器学习 · 统计学 2025-07-08 Daniil Tiapkin , Denis Belomestny , Daniele Calandriello , Eric Moulines , Remi Munos , Alexey Naumov , Pierre Perrault , Michal Valko , Pierre Menard

On the quality of randomized approximations of Tukey's depth

Tukey's depth (or halfspace depth) is a widely used measure of centrality for multivariate data. However, exact computation of Tukey's depth is known to be a hard problem in high dimensions. As a remedy, randomized approximations of Tukey's…

机器学习 · 统计学 2025-07-08 Simon Briend , Gábor Lugosi , Roberto Imbuzeiro Oliveira

The geometry of financial institutions -- Wasserstein clustering of financial data

The increasing availability of granular and big data on various objects of interest has made it necessary to develop methods for condensing this information into a representative and intelligible map. Financial regulation is a field that…

机器学习 · 统计学 2025-07-08 Lorenz Riess , Mathias Beiglböck , Johannes Temme , Andreas Wolf , Julio Backhoff

Learning from Similar Linear Representations: Adaptivity, Minimaxity, and Robustness

Representation multi-task learning (MTL) has achieved tremendous success in practice. However, the theoretical understanding of these methods is still lacking. Most existing theoretical works focus on cases where all tasks share the same…

机器学习 · 统计学 2025-07-08 Ye Tian , Yuqi Gu , Yang Feng

Reformulating van Rijsbergen's $F_{\beta}$ metric for weighted binary cross-entropy

The separation of performance metrics from gradient based loss functions may not always give optimal results and may miss vital aggregate information. This paper investigates incorporating a performance metric alongside differentiable loss…

机器学习 · 统计学 2025-07-08 Satesh Ramdhani

Deep Learning in current Neuroimaging: a multivariate approach with power and type I error control but arguable generalization ability

Discriminative analysis in neuroimaging by means of deep/machine learning techniques is usually tested with validation techniques, whereas the associated statistical significance remains largely under-developed due to their computational…

机器学习 · 统计学 2025-07-08 Carmen Jiménez-Mesa , Javier Ramírez , John Suckling , Jonathan Vöglein , Johannes Levin , Juan Manuel Górriz , Alzheimer's Disease Neuroimaging Initiative ADNI , Dominantly Inherited Alzheimer Network DIAN

Sparse Gaussian Processes: Structured Approximations and Power-EP Revisited

Inducing-point-based sparse variational Gaussian processes have become the standard workhorse for scaling up GP models. Recent advances show that these methods can be improved by introducing a diagonal scaling matrix to the conditional…

机器学习 · 统计学 2025-07-04 Thang D. Bui , Michalis K. Titsias

Transfer Learning for Matrix Completion

In this paper, we explore the knowledge transfer under the setting of matrix completion, which aims to enhance the estimation of a low-rank target matrix with auxiliary data available. We propose a transfer learning procedure given prior…

机器学习 · 统计学 2025-07-04 Dali Liu , Haolei Weng

Adaptive Iterative Soft-Thresholding Algorithm with the Median Absolute Deviation

The adaptive Iterative Soft-Thresholding Algorithm (ISTA) has been a popular algorithm for finding a desirable solution to the LASSO problem without explicitly tuning the regularization parameter $\lambda$. Despite that the adaptive ISTA is…

机器学习 · 统计学 2025-07-04 Yining Feng , Ivan Selesnick

Asymptotically perfect seeded graph matching without edge correlation (and applications to inference)

We present the OmniMatch algorithm for seeded multiple graph matching. In the setting of $d$-dimensional Random Dot Product Graphs (RDPG), we prove that under mild assumptions, OmniMatch with $s$ seeds asymptotically and efficiently…

机器学习 · 统计学 2025-07-04 Tong Qi , Vera Andersson , Peter Viechnicki , Vince Lyzinski

The Choice of Normalization Influences Shrinkage in Regularized Regression

Regularized models are often sensitive to the scales of the features in the data and it has therefore become standard practice to normalize (center and scale) the features before fitting the model. But there are many different ways to…

机器学习 · 统计学 2025-07-04 Johan Larsson , Jonas Wallin

Generalization vs. Specialization under Concept Shift

Machine learning models are often brittle under distribution shift, i.e., when data distributions at test time differ from those during training. Understanding this failure mode is central to identifying and mitigating safety risks of mass…

机器学习 · 统计学 2025-07-04 Alex Nguyen , David J. Schwab , Vudtiwat Ngampruetikorn

A new perspective on Bayesian Operational Modal Analysis

In the field of operational modal analysis (OMA), obtained modal information is frequently used to assess the current state of aerospace, mechanical, offshore and civil structures. However, the stochasticity of operational systems and the…

机器学习 · 统计学 2025-07-04 Brandon J. O'Connell , Max D. Champneys , Timothy J. Rogers

Distribution Matching for Self-Supervised Transfer Learning

In this paper, we propose a novel self-supervised transfer learning method called \underline{\textbf{D}}istribution \underline{\textbf{M}}atching (DM), which drives the representation distribution toward a predefined reference distribution…

机器学习 · 统计学 2025-07-03 Yuling Jiao , Wensen Ma , Defeng Sun , Hansheng Wang , Yang Wang

Long-Context Linear System Identification

This paper addresses the problem of long-context linear system identification, where the state $x_t$ of a dynamical system at time $t$ depends linearly on previous states $x_s$ over a fixed context window of length $p$. We establish a…

机器学习 · 统计学 2025-07-03 Oğuz Kaan Yüksel , Mathieu Even , Nicolas Flammarion

Is merging worth it? Securely evaluating the information gain for causal dataset acquisition

Merging datasets across institutions is a lengthy and costly procedure, especially when it involves private information. Data hosts may therefore want to prospectively gauge which datasets are most beneficial to merge with, without…

机器学习 · 统计学 2025-07-03 Jake Fawkes , Lucile Ter-Minassian , Desi Ivanova , Uri Shalit , Chris Holmes

A Two-Scale Complexity Measure for Deep Learning Models

We introduce a novel capacity measure 2sED for statistical models based on the effective dimension. The new quantity provably bounds the generalization error under mild assumptions on the model. Furthermore, simulations on standard data…

机器学习 · 统计学 2025-07-03 Massimiliano Datres , Gian Paolo Leonardi , Alessio Figalli , David Sutter

Upper and lower bounds for the Lipschitz constant of random neural networks

Empirical studies have widely demonstrated that neural networks are highly sensitive to small, adversarial perturbations of the input. The worst-case robustness against these so-called adversarial examples can be quantified by the Lipschitz…

机器学习 · 统计学 2025-07-03 Paul Geuchen , Dominik Stöger , Thomas Telaar , Felix Voigtlaender