机器学习 — Scifaro

Overcoming Dependent Censoring in the Evaluation of Survival Models

Conventional survival metrics, such as Harrell's concordance index (CI) and the Brier Score, rely on the independent censoring assumption for valid inference with right-censored data. However, in the presence of so-called dependent…

机器学习 · 统计学 2025-05-20 Christian Marius Lillelund , Shi-ang Qi , Russell Greiner

Conformal Prediction under Levy-Prokhorov Distribution Shifts: Robustness to Local and Global Perturbations

Conformal prediction provides a powerful framework for constructing prediction intervals with finite-sample guarantees, yet its robustness under distribution shifts remains a significant challenge. This paper addresses this limitation by…

机器学习 · 统计学 2025-05-20 Liviu Aolaritei , Zheyu Oliver Wang , Julie Zhu , Michael I. Jordan , Youssef Marzouk

Optimal Downsampling for Imbalanced Classification with Generalized Linear Models

Downsampling or under-sampling is a technique that is utilized in the context of large and highly imbalanced classification models. We study optimal downsampling for imbalanced classification using generalized linear models (GLMs). We…

机器学习 · 统计学 2025-05-20 Yan Chen , Jose Blanchet , Krzysztof Dembczynski , Laura Fee Nern , Aaron Flores

Spectral complexity of deep neural networks

It is well-known that randomly initialized, push-forward, fully-connected neural networks weakly converge to isotropic Gaussian processes, in the limit where the width of all layers goes to infinity. In this paper, we propose to use the…

机器学习 · 统计学 2025-05-20 Simmaco Di Lillo , Domenico Marinucci , Michele Salvi , Stefano Vigogna

Convergence Properties of Stochastic Hypergradients

Bilevel optimization problems are receiving increasing attention in machine learning as they provide a natural framework for hyperparameter optimization and meta-learning. A key step to tackle these problems is the efficient computation of…

机器学习 · 统计学 2025-05-20 Riccardo Grazzi , Massimiliano Pontil , Saverio Salzo

Adaptive Linear Embedding for Nonstationary High-Dimensional Optimization

Bayesian Optimization (BO) in high-dimensional spaces remains fundamentally limited by the curse of dimensionality and the rigidity of global low-dimensional assumptions. While Random EMbedding Bayesian Optimization (REMBO) mitigates this…

机器学习 · 统计学 2025-05-19 Yuejiang Wen , Paul D. Franzon

A Fourier Space Perspective on Diffusion Models

Diffusion models are state-of-the-art generative models on data modalities such as images, audio, proteins and materials. These modalities share the property of exponentially decaying variance and magnitude in the Fourier domain. Under the…

机器学习 · 统计学 2025-05-19 Fabian Falck , Teodora Pandeva , Kiarash Zahirnia , Rachel Lawrence , Richard Turner , Edward Meeds , Javier Zazo , Sushrut Karmalkar

On Next-Token Prediction in LLMs: How End Goals Determine the Consistency of Decoding Algorithms

Probabilistic next-token prediction trained using cross-entropy loss is the basis of most large language models. Given a sequence of previous values, next-token prediction assigns a probability to each possible next value in the vocabulary.…

机器学习 · 统计学 2025-05-19 Jacob Trauger , Ambuj Tewari

An Exponential Averaging Process with Strong Convergence Properties

Averaging, or smoothing, is a fundamental approach to obtain stable, de-noised estimates from noisy observations. In certain scenarios, observations made along trajectories of random dynamical systems are of particular interest. One popular…

机器学习 · 统计学 2025-05-19 Frederik Köhne , Anton Schiela

Positional Encoder Graph Quantile Neural Networks for Geographic Data

Positional Encoder Graph Neural Networks (PE-GNNs) are among the most effective models for learning from continuous spatial data. However, their predictive distributions are often poorly calibrated, limiting their utility in applications…

机器学习 · 统计学 2025-05-19 William E. R. de Amorim , Scott A. Sisson , T. Rodrigues , David J. Nott , Guilherme S. Rodrigues

A replica analysis of under-bagging

Under-bagging (UB), which combines under-sampling and bagging, is a popular ensemble learning method for training classifiers on an imbalanced data. Using bagging to reduce the increased variance caused by the reduction in sample size due…

机器学习 · 统计学 2025-05-19 Takashi Takahashi

On the Nonconvexity of Push-Forward Constraints and Its Consequences in Machine Learning

The push-forward operation enables one to redistribute a probability measure through a deterministic map. It plays a key role in statistics and optimization: many learning problems (notably from optimal transport, generative modeling, and…

机器学习 · 统计学 2025-05-19 Lucas de Lara , Mathis Deronzier , Alberto González-Sanz , Virgile Foy

Changing the Kernel During Training Leads to Double Descent in Kernel Regression

We investigate changing the bandwidth of a translational-invariant kernel during training when solving kernel regression with gradient descent. We present a theoretical bound on the out-of-sample generalization error that advocates for…

机器学习 · 统计学 2025-05-19 Oskar Allerbo

Auditing Fairness by Betting

We provide practical, efficient, and nonparametric methods for auditing the fairness of deployed classification and regression models. Whereas previous work relies on a fixed-sample size, our methods are sequential and allow for the…

机器学习 · 统计学 2025-05-19 Ben Chugg , Santiago Cortes-Gomez , Bryan Wilder , Aaditya Ramdas

Welfare Analysis in Dynamic Models

This paper introduces metrics for welfare analysis in dynamic models. We develop estimation and inference for these parameters even in the presence of a high-dimensional state space. Examples of welfare metrics include average welfare,…

机器学习 · 统计学 2025-05-19 Victor Chernozhukov , Whitney Newey , Vira Semenova

FlowVAT: Normalizing Flow Variational Inference with Affine-Invariant Tempering

Multi-modal and high-dimensional posteriors present significant challenges for variational inference, causing mode-seeking behavior and collapse despite the theoretical expressiveness of normalizing flows. Traditional annealing methods…

机器学习 · 统计学 2025-05-16 Juehang Qin , Shixiao Liang , Christopher Tunnell

Efficient MCMC Sampling with Expensive-to-Compute and Irregular Likelihoods

Bayesian inference with Markov Chain Monte Carlo (MCMC) is challenging when the likelihood function is irregular and expensive to compute. We explore several sampling algorithms that make use of subset evaluations to reduce computational…

机器学习 · 统计学 2025-05-16 Conor Rosato , Harvinder Lehal , Simon Maskell , Lee Devlin , Malcolm Strens

A Scalable Gradient-Based Optimization Framework for Sparse Minimum-Variance Portfolio Selection

Portfolio optimization involves selecting asset weights to minimize a risk-reward objective, such as the portfolio variance in the classical minimum-variance framework. Sparse portfolio selection extends this by imposing a cardinality…

机器学习 · 统计学 2025-05-16 Sarat Moka , Matias Quiroz , Vali Asimit , Samuel Muller

Learning Multi-Attribute Differential Graphs with Non-Convex Penalties

We consider the problem of estimating differences in two multi-attribute Gaussian graphical models (GGMs) which are known to have similar structure, using a penalized D-trace loss function with non-convex penalties. The GGM structure is…

机器学习 · 统计学 2025-05-16 Jitendra K Tugnait

Penalized Overdamped and Underdamped Langevin Monte Carlo Algorithms for Constrained Sampling

We consider the constrained sampling problem where the goal is to sample from a target distribution $\pi(x)\propto e^{-f(x)}$ when $x$ is constrained to lie on a convex body $\mathcal{C}$. Motivated by penalty methods from continuous…

机器学习 · 统计学 2025-05-16 Mert Gürbüzbalaban , Yuanhan Hu , Lingjiong Zhu