机器学习 — Scifaro

Transfer Learning Across Fixed-Income Product Classes

We propose a framework for transfer learning of discount curves across different fixed-income product classes. Motivated by challenges in estimating discount curves from sparse or noisy data, we extend kernel ridge regression (KR) to a…

机器学习 · 统计学 2026-01-14 Nicolas Camenzind , Damir Filipovic

Interactive and Hybrid Imitation Learning: Provably Beating Behavior Cloning

Imitation learning (IL) is a paradigm for learning sequential decision making policies from experts, leveraging offline demonstrations, interactive annotations, or both. Recent advances show that when annotation cost is tallied per…

机器学习 · 统计学 2026-01-14 Yichen Li , Chicheng Zhang

Dual-Level Models for Physics-Informed Multi-Step Time Series Forecasting

This paper develops an approach for multi-step forecasting of dynamical systems by integrating probabilistic input forecasting with physics-informed output prediction. Accurate multi-step forecasting of time series systems is important for…

机器学习 · 统计学 2026-01-13 Mahdi Nasiri , Johanna Kortelainen , Simo Särkkä

Nonparametric Kernel Clustering with Bandit Feedback

Clustering with bandit feedback refers to the problem of partitioning a set of items, where the clustering algorithm can sequentially query the items to receive noisy observations. The problem is formally posed as the task of partitioning…

机器学习 · 统计学 2026-01-13 Victor Thuot , Sebastian Vogt , Debarghya Ghoshdastidar , Nicolas Verzelen

Position: Don't be Afraid of Over-Smoothing And Over-Squashing

Over-smoothing and over-squashing have been extensively studied in the literature on Graph Neural Networks (GNNs) over the past years. We challenge this prevailing focus in GNN research, arguing that these phenomena are less critical for…

机器学习 · 统计学 2026-01-13 Niklas Kormann , Benjamin Doerr , Johannes F. Lutzeyer

Covariance-Driven Regression Trees: Reducing Overfitting in CART

Decision trees are powerful machine learning algorithms, widely used in fields such as economics and medicine for their simplicity and interpretability. However, decision trees such as CART are prone to overfitting, especially when grown…

机器学习 · 统计学 2026-01-13 Likun Zhang , Wei Ma

Robust Mean Estimation under Quantization

We consider the problem of mean estimation under quantization and adversarial corruption. We construct multivariate robust estimators that are optimal up to logarithmic factors in two different settings. The first is a one-bit setting,…

机器学习 · 统计学 2026-01-13 Pedro Abdalla , Junren Chen

Conditional Normalizing Flows for Forward and Backward Joint State and Parameter Estimation

Traditional filtering algorithms for state estimation -- such as classical Kalman filtering, unscented Kalman filtering, and particle filters - show performance degradation when applied to nonlinear systems whose uncertainty follows…

机器学习 · 统计学 2026-01-13 Luke S. Lagunowich , Guoxiang Grayson Tong , Daniele E. Schiavazzi

Match Made with Matrix Completion: Efficient Learning under Matching Interference

Matching markets face increasing needs to learn the matching qualities between demand and supply for effective design of matching policies. In practice, the matching rewards are high-dimensional due to the growing diversity of participants.…

机器学习 · 统计学 2026-01-13 Zhiyuan Tang , Wanning Chen , Kan Xu

The Impact of Anisotropic Covariance Structure on the Training Dynamics and Generalization Error of Linear Networks

The success of deep neural networks largely depends on the statistical structure of the training data. While learning dynamics and generalization on isotropic data are well-established, the impact of pronounced anisotropy on these crucial…

机器学习 · 统计学 2026-01-13 Taishi Watanabe , Ryo Karakida , Jun-nosuke Teramae

Dimension-reduced outcome-weighted learning for estimating individualized treatment regimes in observational studies

Individualized treatment regimes (ITRs) aim to improve clinical outcomes by assigning treatment based on patient-specific characteristics. However, existing methods often struggle with high-dimensional covariates, limiting accuracy,…

机器学习 · 统计学 2026-01-13 Sungtaek Son , Eardi Lila , Kwun Chuen Gary Chan

Physics-informed Gaussian Process Regression in Solving Eigenvalue Problem of Linear Operators

Applying Physics-Informed Gaussian Process Regression to the eigenvalue problem $(\mathcal{L}-\lambda)u = 0$ poses a fundamental challenge, where the null source term results in a trivial predictive mean and a degenerate marginal…

机器学习 · 统计学 2026-01-13 Tianming Bai , Jiannan Yang

Copula-Stein Discrepancy: A Generator-Based Stein Operator for Archimedean Dependence

Kernel Stein discrepancies (KSDs) are widely used for goodness-of-fit testing, but standard KSDs can be insensitive to higher-order dependence features such as tail dependence. We introduce the Copula-Stein Discrepancy (CSD), which defines…

机器学习 · 统计学 2026-01-13 Agnideep Aich , Ashit Baran Aich

Predictive inference for time series: why is split conformal effective despite temporal dependence?

We consider the problem of uncertainty quantification for prediction in a time series: if we use past data to forecast the next time point, can we provide valid prediction intervals around our forecasts? To avoid placing distributional…

机器学习 · 统计学 2026-01-13 Rina Foygel Barber , Ashwin Pananjady

A Kernel-based Stochastic Approximation Framework for Nonlinear Operator Learning

We develop a stochastic approximation framework for learning nonlinear operators between infinite-dimensional spaces utilizing general Mercer operator-valued kernels. Our framework encompasses two key classes: (i) compact kernels, which…

机器学习 · 统计学 2026-01-13 Jia-Qi Yang , Lei Shi

Bag of Coins: A Statistical Probe into Neural Confidence Structures

Modern neural networks often produce miscalibrated confidence scores and struggle to detect out-of-distribution (OOD) inputs, while most existing methods post-process outputs without testing internal consistency. We introduce the…

机器学习 · 统计学 2026-01-13 Agnideep Aich , Sameera Hewage , Md Monzur Murshed , Bruce Wade , Ashit Baran Aich

When Less Is More: Binary Feedback Can Outperform Ordinal Comparisons in Ranking Recovery

Paired comparison data, where users evaluate items in pairs, play a central role in ranking and preference learning tasks. While ordinal comparison data intuitively offer richer information than binary comparisons, this paper challenges…

机器学习 · 统计学 2026-01-13 Shirong Xu , Jingnan Zhang , Junhui Wang

Stable Minima of ReLU Neural Networks Suffer from the Curse of Dimensionality: The Neural Shattering Phenomenon

We study the implicit bias of flatness / low (loss) curvature and its effects on generalization in two-layer overparameterized ReLU networks with multivariate inputs -- a problem well motivated by the minima stability and edge-of-stability…

机器学习 · 统计学 2026-01-13 Tongtong Liang , Dan Qiao , Yu-Xiang Wang , Rahul Parhi

Convergence Rates of Constrained Expected Improvement

Constrained Bayesian optimization (CBO) methods have seen significant success in black-box optimization with constraints. One of the most commonly used CBO methods is the constrained expected improvement (CEI) algorithm. CEI is a natural…

机器学习 · 统计学 2026-01-13 Haowei Wang , Jingyi Wang , Zhongxiang Dai , Nai-Yuan Chiang , Szu Hui Ng , Cosmin G. Petra

Simulation of Multivariate Extremes: a Wasserstein-Aitchison GAN approach

Economically responsible mitigation of multivariate extreme risks-such as extreme rainfall over large areas, large simultaneous variations in many stock prices, or widespread breakdowns in transportation systems-requires assessing the…

机器学习 · 统计学 2026-01-13 Stéphane Lhaut , Holger Rootzén , Johan Segers