机器学习 — Scifaro

Learning an Optimal Assortment Policy under Observational Data

We study the fundamental problem of offline assortment optimization under the Multinomial Logit (MNL) model, where sellers must determine the optimal subset of the products to offer based solely on historical customer choice data. While…

机器学习 · 统计学 2025-08-26 Yuxuan Han , Han Zhong , Miao Lu , Jose Blanchet , Zhengyuan Zhou

Poisson Hierarchical Indian Buffet Processes-With Indications for Microbiome Species Sampling Models

We introduce the Poisson Hierarchical Indian Buffet Process (PHIBP), a new class of species sampling models designed to address the challenges of complex, sparse count data by facilitating information sharing across and within groups. Our…

机器学习 · 统计学 2025-08-26 Lancelot F. James , Juho Lee , Abhinav Pandey

Adversarial Robustness in Two-Stage Learning-to-Defer: Algorithms and Guarantees

Two-stage Learning-to-Defer (L2D) enables optimal task delegation by assigning each input to either a fixed main model or one of several offline experts, supporting reliable decision-making in complex, multi-agent environments. However,…

机器学习 · 统计学 2025-08-26 Yannis Montreuil , Axel Carlier , Lai Xing Ng , Wei Tsang Ooi

Learning from Summarized Data: Gaussian Process Regression with Sample Quasi-Likelihood

Gaussian process regression is a powerful Bayesian nonlinear regression method. Recent research has enabled the capture of many types of observations using non-Gaussian likelihoods. To deal with various tasks in spatial modeling, we benefit…

机器学习 · 统计学 2025-08-26 Yuta Shikuri

Fitting Multilevel Factor Models

We examine a special case of the multilevel factor model, with covariance given by multilevel low rank (MLR) matrix~\cite{parshakova2023factor}. We develop a novel, fast implementation of the expectation-maximization algorithm, tailored for…

机器学习 · 统计学 2025-08-26 Tetiana Parshakova , Trevor Hastie , Stephen Boyd

On the Algorithmic Bias of Aligning Large Language Models with RLHF: Preference Collapse and Matching Regularization

Accurately aligning large language models (LLMs) with human preferences is crucial for informing fair, economically sound, and statistically efficient decision-making processes. However, we argue that the predominant approach for aligning…

机器学习 · 统计学 2025-08-26 Jiancong Xiao , Ziniu Li , Xingyu Xie , Emily Getzen , Cong Fang , Qi Long , Weijie J. Su

Simulation Based Bayesian Optimization

Bayesian Optimization (BO) is a powerful method for optimizing black-box functions by combining prior knowledge with ongoing function evaluations. BO constructs a probabilistic surrogate model of the objective function given the covariates,…

机器学习 · 统计学 2025-08-26 Roi Naveiro , Becky Tang

Conditional Stochastic Interpolation for Generative Learning

We propose a conditional stochastic interpolation (CSI) method for learning conditional distributions. CSI is based on estimating probability flow equations or stochastic differential equations that transport a reference distribution to the…

机器学习 · 统计学 2025-08-26 Ding Huang , Jian Huang , Ting Li , Guohao Shen

Underdamped Langevin MCMC with third order convergence

In this paper, we propose a new numerical method for the underdamped Langevin diffusion (ULD) and present a non-asymptotic analysis of its sampling error in the 2-Wasserstein distance when the $d$-dimensional target distribution…

机器学习 · 统计学 2025-08-25 Maximilian Scott , Dáire O'Kane , Andraž Jelinčič , James Foster

Deep Intrinsic Coregionalization Multi-Output Gaussian Process Surrogate with Active Learning

Deep Gaussian Processes (DGPs) are powerful surrogate models known for their flexibility and ability to capture complex functions. However, extending them to multi-output settings remains challenging due to the need for efficient dependency…

机器学习 · 统计学 2025-08-25 Chun-Yi Chang , Chih-Li Sung

A Sharp KL-Convergence Analysis for Diffusion Models under Minimal Assumptions

Diffusion-based generative models have emerged as highly effective methods for synthesizing high-quality samples. Recent works have focused on analyzing the convergence of their generation process with minimal assumptions, either through…

机器学习 · 统计学 2025-08-25 Nishant Jain , Tong Zhang

Interpretable Kernels

The use of kernels for nonlinear prediction is widespread in machine learning. They have been popularized in support vector machines and used in kernel ridge regression, amongst others. Kernel methods share three aspects. First, instead of…

机器学习 · 统计学 2025-08-25 Patrick J. F. Groenen , Michael Greenacre

General and Estimable Learning Bound Unifying Covariate and Concept Shifts

Generalization under distribution shift remains a core challenge in modern machine learning, yet existing learning bound theory is limited to narrow, idealized settings and is non-estimable from samples. In this paper, we bridge the gap…

机器学习 · 统计学 2025-08-25 Hongbo Chen , Li Charlie Xia

Generative diffusion posterior sampling for informative likelihoods

Sequential Monte Carlo (SMC) methods have recently shown successful results for conditional sampling of generative diffusion models. In this paper we propose a new diffusion posterior SMC sampler achieving improved statistical efficiencies,…

机器学习 · 统计学 2025-08-25 Zheng Zhao

A distance for mixed-variable and hierarchical domains with meta variables

Heterogeneous datasets emerge in various machine learning and optimization applications that feature different input sources, types or formats. Most models or methods do not natively tackle heterogeneity. Hence, such datasets are often…

机器学习 · 统计学 2025-08-25 Edward Hallé-Hannan , Charles Audet , Youssef Diouane , Sébastien Le Digabel , Paul Saves

Gaussian Process Inference Using Mini-batch Stochastic Gradient Descent: Convergence Guarantees and Empirical Benefits

Stochastic gradient descent (SGD) and its variants have established themselves as the go-to algorithms for large-scale machine learning problems with independent samples due to their generalization performance and intrinsic computational…

机器学习 · 统计学 2025-08-25 Hao Chen , Lili Zheng , Raed Al Kontar , Garvesh Raskutti

Tree-like Pairwise Interaction Networks

Modeling feature interactions in tabular data remains a key challenge in predictive modeling, for example, as used for insurance pricing. This paper proposes the Tree-like Pairwise Interaction Network (PIN), a novel neural network…

机器学习 · 统计学 2025-08-22 Ronald Richman , Salvatore Scognamiglio , Mario V. Wüthrich

Bayesian Optimization with Expected Improvement: No Regret and the Choice of Incumbent

Expected improvement (EI) is one of the most widely used acquisition functions in Bayesian optimization (BO). Despite its proven empirical success in applications, the cumulative regret upper bound of EI remains an open question. In this…

机器学习 · 统计学 2025-08-22 Jingyi Wang , Haowei Wang , Szu Hui Ng , Cosmin G. Petra

Bayesian Inference and Learning in Nonlinear Dynamical Systems: A Framework for Incorporating Explicit and Implicit Prior Knowledge

Accuracy and generalization capabilities are key objectives when learning dynamical system models. To obtain such models from limited data, current works exploit prior knowledge and assumptions about the system. However, the fusion of…

机器学习 · 统计学 2025-08-22 Björn Volkmann , Jan-Hendrik Ewering , Michael Meindl , Simon F. G. Ehlers , Thomas Seel

Kernel-based Equalized Odds: A Quantification of Accuracy-Fairness Trade-off in Fair Representation Learning

This paper introduces a novel kernel-based formulation of the Equalized Odds (EO) criterion, denoted as $EO_k$, for fair representation learning (FRL) in supervised settings. The central goal of FRL is to mitigate discrimination regarding a…

机器学习 · 统计学 2025-08-22 Yijin Ni , Xiaoming Huo