机器学习 — Scifaro

Quantifying Normality: Convergence Rate to Gaussian Limit for Stochastic Approximation and Unadjusted OU Algorithm

Stochastic approximation (SA) is a method for finding the root of an operator perturbed by noise. There is a rich literature establishing the asymptotic normality of rescaled SA iterates under fairly mild conditions. However, these…

机器学习 · 统计学 2026-02-17 Shaan Ul Haque , Zedong Wang , Zixuan Zhang , Siva Theja Maguluri

Locally Private Parametric Methods for Change-Point Detection

We study parametric change-point detection, where the goal is to identify distributional changes in time series, under local differential privacy. In the non-private setting, we derive improved finite-sample accuracy guarantees for a…

机器学习 · 统计学 2026-02-17 Anuj Kumar Yadav , Cemre Cadir , Yanina Shkel , Michael Gastpar

Metabolic cost of information processing in Poisson variational autoencoders

Computation in biological systems is fundamentally energy-constrained, yet standard theories of computation treat energy as freely available. Here, we argue that variational free energy minimization under a Poisson assumption offers a…

机器学习 · 统计学 2026-02-17 Hadi Vafaii , Jacob L. Yates

Nonparametric Distribution Regression Re-calibration

A key challenge in probabilistic regression is ensuring that predictive distributions accurately reflect true empirical uncertainty. Minimizing overall prediction error often encourages models to prioritize informativeness over calibration,…

机器学习 · 统计学 2026-02-17 Ádám Jung , Domokos M. Kelen , András A. Benczúr

Discrete Adjoint Matching

Computation methods for solving entropy-regularized reward optimization -- a class of problems widely used for fine-tuning generative models -- have advanced rapidly. Among those, Adjoint Matching (AM, Domingo-Enrich et al., 2025) has…

机器学习 · 统计学 2026-02-17 Oswin So , Brian Karrer , Chuchu Fan , Ricky T. Q. Chen , Guan-Horng Liu

Optimal Learning-Rate Schedules under Functional Scaling Laws: Power Decay and Warmup-Stable-Decay

We study optimal learning-rate schedules (LRSs) under the functional scaling law (FSL) framework introduced in Li et al. (2025), which accurately models the loss dynamics of both linear regression and large language model (LLM)…

机器学习 · 统计学 2026-02-17 Binghui Li , Zilin Wang , Fengling Chen , Shiyang Zhao , Ruiheng Zheng , Lei Wu

High-Dimensional Limit of Stochastic Gradient Flow via Dynamical Mean-Field Theory

Modern machine learning models are typically trained via multi-pass stochastic gradient descent (SGD) with small batch sizes, and understanding their dynamics in high dimensions is of great interest. However, an analytical framework for…

机器学习 · 统计学 2026-02-17 Sota Nishiyama , Masaaki Imaizumi

Robust Generalization with Adaptive Optimal Transport Priors for Decision-Focused Learning

Few-shot learning requires models to generalize under limited supervision while remaining robust to distribution shifts. Existing Sinkhorn Distributionally Robust Optimization (DRO) methods provide theoretical guarantees but rely on a fixed…

机器学习 · 统计学 2026-02-17 Haixiang Sun , Andrew L. Liu

A Decomposable Forward Process in Diffusion Models for Time-Series Forecasting

We introduce a model-agnostic forward diffusion process for time-series forecasting that decomposes signals into spectral components, preserving structured temporal patterns such as seasonality more effectively than standard diffusion.…

机器学习 · 统计学 2026-02-17 Francisco Caldas , Sahil Kumar , Cláudia Soares

Optimization and Regularization Under Arbitrary Objectives

This study investigates the limitations of applying Markov Chain Monte Carlo (MCMC) methods to arbitrary objective functions, focusing on a two-block MCMC framework which alternates between Metropolis-Hastings and Gibbs sampling. While such…

机器学习 · 统计学 2026-02-17 Jared N. Lakhani , Etienne Pienaar

Robust Bayesian Optimisation with Unbounded Corruptions

Bayesian Optimization is critically vulnerable to extreme outliers. Existing provably robust methods typically assume a bounded cumulative corruption budget, which makes them defenseless against even a single corruption of sufficient…

机器学习 · 统计学 2026-02-17 Abdelhamid Ezzerg , Ilija Bogunovic , Jeremias Knoblauch

Bias-Corrected Data Synthesis for Imbalanced Learning

Imbalanced data, where the positive samples represent only a small proportion compared to the negative samples, makes it challenging for classification problems to balance the false positive and false negative rates. A common approach to…

机器学习 · 统计学 2026-02-17 Pengfei Lyu , Zhengchi Ma , Linjun Zhang , Anru R. Zhang

Learning under Quantization for High-Dimensional Linear Regression

The use of low-bit quantization has emerged as an indispensable technique for enabling the efficient training of large-scale models. Despite its widespread empirical success, a rigorous theoretical understanding of its impact on learning…

机器学习 · 统计学 2026-02-17 Dechen Zhang , Junwei Su , Difan Zou

Differentially Private Two-Stage Gradient Descent for Instrumental Variable Regression

We study instrumental variable regression (IVaR) under differential privacy constraints. Classical IVaR methods (like two-stage least squares regression) rely on solving moment equations that directly use sensitive covariates and…

机器学习 · 统计学 2026-02-17 Haodong Liang , Yanhao Jin , Krishnakumar Balasubramanian , Lifeng Lai

The Statistical Fairness-Accuracy Frontier

We study fairness-accuracy tradeoffs when a single predictive model must serve multiple demographic groups. A useful tool for understanding this tradeoff is the fairness-accuracy (FA) Pareto frontier, which characterizes the set of models…

机器学习 · 统计学 2026-02-17 Alireza Fallah , Michael I. Jordan , Annie Ulichney

The Majority Vote Paradigm Shift: When Popular Meets Optimal

Reliably labelling data typically requires annotations from multiple human workers. However, humans are far from being perfect. Hence, it is a common practice to aggregate labels gathered from multiple annotators to make a more confident…

机器学习 · 统计学 2026-02-17 Antonio Purificato , Maria Sofia Bucarelli , Anil Kumar Nelakanti , Andrea Bacciu , Fabrizio Silvestri , Amin Mantrach

Guaranteed Nonconvex Low-Rank Tensor Estimation via Scaled Gradient Descent

Tensors, which give a faithful and effective representation to deliver the intrinsic structure of multi-dimensional data, play a crucial role in an increasing number of signal processing and machine learning problems. However, tensor data…

机器学习 · 统计学 2026-02-17 Tong Wu

Denoising Diffusions with Optimal Transport: Localization, Curvature, and Multi-Scale Complexity

Adding noise is easy; what about denoising? Diffusion is easy; what about reverting a diffusion? Diffusion-based generative models aim to denoise a Langevin diffusion chain, moving from a log-concave equilibrium measure $\nu$, say an…

机器学习 · 统计学 2026-02-17 Tengyuan Liang , Kulunu Dharmakeerthi , Takuya Koriyama

Deep Two-Way Matrix Reordering for Relational Data Analysis

Matrix reordering is a task to permute the rows and columns of a given observed matrix such that the resulting reordered matrix shows meaningful or interpretable structural patterns. Most existing matrix reordering techniques share the…

机器学习 · 统计学 2026-02-17 Chihiro Watanabe , Taiji Suzuki

AdaGrad-Diff: A New Version of the Adaptive Gradient Algorithm

Vanilla gradient methods are often highly sensitive to the choice of stepsize, which typically requires manual tuning. Adaptive methods alleviate this issue and have therefore become widely used. Among them, AdaGrad has been particularly…

机器学习 · 统计学 2026-02-16 Matia Bojovic , Saverio Salzo , Massimiliano Pontil