机器学习 — Scifaro

Enhancing Gradient-based Discrete Sampling via Parallel Tempering

While gradient-based discrete samplers are effective in sampling from complex distributions, they are susceptible to getting trapped in local minima, particularly in high-dimensional, multimodal discrete distributions, owing to the…

机器学习 · 统计学 2025-05-21 Luxu Liang , Yuhang Jia , Feng Zhou

Does Unsupervised Domain Adaptation Improve the Robustness of Amortized Bayesian Inference? A Systematic Evaluation

Neural networks are fragile when confronted with data that significantly deviates from their training distribution. This is true in particular for simulation-based inference methods, such as neural amortized Bayesian inference (ABI), where…

机器学习 · 统计学 2025-05-21 Lasse Elsemüller , Valentin Pratz , Mischa von Krause , Andreas Voss , Paul-Christian Bürkner , Stefan T. Radev

CARROT: A Cost Aware Rate Optimal Router

With the rapid growth in the number of Large Language Models (LLMs), there has been a recent interest in LLM routing, or directing queries to the cheapest LLM that can deliver a suitable response. We conduct a minimax analysis of the…

机器学习 · 统计学 2025-05-21 Seamus Somerstep , Felipe Maia Polo , Allysson Flavio Melo de Oliveira , Prattyush Mangal , Mírian Silva , Onkar Bhardwaj , Mikhail Yurochkin , Subha Maity

A new approach to locally adaptive polynomial regression

Adaptive bandwidth selection is a fundamental challenge in nonparametric regression. This paper introduces a new bandwidth selection procedure inspired by the optimality criteria for $\ell_0$-penalized regression. Although similar in spirit…

机器学习 · 统计学 2025-05-21 Sabyasachi Chatterjee , Subhajit Goswami , Soumendu Sundar Mukherjee

Subspace Langevin Monte Carlo

Sampling from high-dimensional distributions has wide applications in data science and machine learning but poses significant computational challenges. We introduce Subspace Langevin Monte Carlo (SLMC), a novel and efficient sampling method…

机器学习 · 统计学 2025-05-21 Tyler Maunu , Jiayi Yao

Nonlinear Meta-Learning Can Guarantee Faster Rates

Many recent theoretical works on \emph{meta-learning} aim to achieve guarantees in leveraging similar representational structures from related tasks towards simplifying a target task. The main aim of theoretical guarantees on the subject is…

机器学习 · 统计学 2025-05-21 Dimitri Meunier , Zhu Li , Arthur Gretton , Samory Kpotufe

Sequential Kernelized Independence Testing

Independence testing is a classical statistical problem that has been extensively studied in the batch setting when one fixes the sample size before collecting data. However, practitioners often prefer procedures that adapt to the…

机器学习 · 统计学 2025-05-21 Aleksandr Podkopaev , Patrick Blöbaum , Shiva Prasad Kasiviswanathan , Aaditya Ramdas

Detection of Interacting Variables for Generalized Linear Models via Neural Networks

The quality of generalized linear models (GLMs), frequently used by insurance companies, depends on the choice of interacting variables. The search for interactions is time-consuming, especially for data sets with a large number of…

机器学习 · 统计学 2025-05-21 Yevhen Havrylenko , Julia Heger

Scalable Importance Sampling in High Dimensions with Low-Rank Mixture Proposals

Importance sampling is a Monte Carlo technique for efficiently estimating the likelihood of rare events by biasing the sampling distribution towards the rare event of interest. By drawing weighted samples from a learned proposal…

机器学习 · 统计学 2025-05-20 Liam A. Kruse , Marc R. Schlichting , Mykel J. Kochenderfer

From What Ifs to Insights: Counterfactuals in Causal Inference vs. Explainable AI

Counterfactuals play a pivotal role in the two distinct data science fields of causal inference (CI) and explainable artificial intelligence (XAI). While the core idea behind counterfactuals remains the same in both fields--the examination…

机器学习 · 统计学 2025-05-20 Galit Shmueli , David Martens , Jaewon Yoo , Travis Greene

Smoothed SGD for quantiles: Bahadur representation and Gaussian approximation

This paper considers the estimation of quantiles via a smoothed version of the stochastic gradient descent (SGD) algorithm. By smoothing the score function in the conventional SGD quantile algorithm, we achieve monotonicity in the quantile…

机器学习 · 统计学 2025-05-20 Likai Chen , Georg Keilbar , Wei Biao Wu

Causality-Inspired Robustness for Nonlinear Models via Representation Learning

Distributional robustness is a central goal of prediction algorithms due to the prevalent distribution shifts in real-world data. The prediction model aims to minimize the worst-case risk among a class of distributions, a.k.a., an…

机器学习 · 统计学 2025-05-20 Marin Šola , Peter Bühlmann , Xinwei Shen

Multi-modal contrastive learning adapts to intrinsic dimensions of shared latent variables

Multi-modal contrastive learning as a self-supervised representation learning technique has achieved great success in foundation model training, such as CLIP~\citep{radford2021learning}. In this paper, we study the theoretical properties of…

机器学习 · 统计学 2025-05-20 Yu Gui , Cong Ma , Zongming Ma

Wasserstein Barycenter Gaussian Process based Bayesian Optimization

Gaussian Process based Bayesian Optimization is a widely applied algorithm to learn and optimize under uncertainty, well-known for its sample efficiency. However, recently -- and more frequently -- research studies have empirically…

机器学习 · 统计学 2025-05-20 Antonio Candelieri , Andrea Ponti , Francesco Archetti

High-Dimensional Dynamic Covariance Models with Random Forests

This paper introduces a novel nonparametric method for estimating high-dimensional dynamic covariance matrices with multiple conditioning covariates, leveraging random forests and supported by robust theoretical guarantees. Unlike…

机器学习 · 统计学 2025-05-20 Shuguang Yu , Fan Zhou , Yingjie Zhang , Ziqi Chen , Hongtu Zhu

T-Rex: Fitting a Robust Factor Model via Expectation-Maximization

Over the past decades, there has been a surge of interest in studying low-dimensional structures within high-dimensional data. Statistical factor models $-$ i.e., low-rank plus diagonal covariance structures $-$ offer a powerful framework…

机器学习 · 统计学 2025-05-20 Daniel Cederberg

Multi-Attribute Graph Estimation with Sparse-Group Non-Convex Penalties

We consider the problem of inferring the conditional independence graph (CIG) of high-dimensional Gaussian vectors from multi-attribute data. Most existing methods for graph estimation are based on single-attribute models where one…

机器学习 · 统计学 2025-05-20 Jitendra K Tugnait

Humble your Overconfident Networks: Unlearning Overfitting via Sequential Monte Carlo Tempered Deep Ensembles

Sequential Monte Carlo (SMC) methods offer a principled approach to Bayesian uncertainty quantification but are traditionally limited by the need for full-batch gradient evaluations. We introduce a scalable variant by incorporating…

机器学习 · 统计学 2025-05-20 Andrew Millard , Zheng Zhao , Joshua Murphy , Simon Maskell

Calibration Strategies for Robust Causal Estimation: Theoretical and Empirical Insights on Propensity Score-Based Estimators

The partitioning of data for estimation and calibration critically impacts the performance of propensity score based estimators like inverse probability weighting (IPW) and double/debiased machine learning (DML) frameworks. We extend recent…

机器学习 · 统计学 2025-05-20 Sven Klaassen , Jan Rabenseifner , Jannis Kueck , Philipp Bach

Asymptotic Analysis of Two-Layer Neural Networks after One Gradient Step under Gaussian Mixtures Data with Structure

In this work, we study the training and generalization performance of two-layer neural networks (NNs) after one gradient descent step under structured data modeled by Gaussian mixtures. While previous research has extensively analyzed this…

机器学习 · 统计学 2025-05-20 Samet Demir , Zafer Dogan