机器学习 — Scifaro

The Bayesian Method of Tensor Networks

Bayesian learning is a powerful learning framework which combines the external information of the data (background information) with the internal information (training data) in a logically consistent way in inference and prediction. By…

机器学习 · 统计学 2026-02-11 Erdong Guo , David Draper

Provably robust learning of regression neural networks using $\beta$-divergences

Regression neural networks (NNs) are most commonly trained by minimizing the mean squared prediction error, which is highly sensitive to outliers and data contamination. Existing robust training methods for regression NNs are often limited…

机器学习 · 统计学 2026-02-10 Abhik Ghosh , Suryasis Jana

Winner's Curse Drives False Promises in Data-Driven Decisions: A Case Study in Refugee Matching

A major challenge in data-driven decision-making is accurate policy evaluation-i.e., guaranteeing that a learned decision-making policy achieves the promised benefits. A popular strategy is model-based policy evaluation, which estimates a…

机器学习 · 统计学 2026-02-10 Hamsa Bastani , Osbert Bastani , Bryce McLaughlin

Cutting Through the Noise: On-the-fly Outlier Detection for Robust Training of Machine Learning Interatomic Potentials

The accuracy of machine learning interatomic potentials suffers from reference data that contains numerical noise. Often originating from unconverged or inconsistent electronic-structure calculations, this noise is challenging to identify.…

机器学习 · 统计学 2026-02-10 Terry C. W. Lam , Niamh O'Neill , Christoph Schran , Lars L. Schaaf

Amortising Inference and Meta-Learning Priors in Neural Networks

One of the core facets of Bayesianism is in the updating of prior beliefs in light of new evidence$\text{ -- }$so how can we maintain a Bayesian approach if we have no prior beliefs in the first place? This is one of the central challenges…

机器学习 · 统计学 2026-02-10 Tommy Rochussen , Vincent Fortuin

Schr\"odinger bridge problem via empirical risk minimization

We study the Schr\"odinger bridge problem when the endpoint distributions are available only through samples. Classical computational approaches estimate Schr\"odinger potentials via Sinkhorn iterations on empirical measures and then…

机器学习 · 统计学 2026-02-10 Denis Belomestny , Alexey Naumov , Nikita Puchkin , Denis Suchkov

A Statistical Framework for Alignment with Biased AI Feedback

Modern alignment pipelines are increasingly replacing expensive human preference labels with evaluations from large language models (LLM-as-Judge). However, AI labels can be systematically biased compared to high-quality human feedback…

机器学习 · 统计学 2026-02-10 Xintao Xia , Zhiqiu Xia , Linjun Zhang , Zhanrui Cai

Discrete Adjoint Schr\"odinger Bridge Sampler

Learning discrete neural samplers is challenging due to the lack of gradients and combinatorial complexity. While stochastic optimal control (SOC) and Schr\"odinger bridge (SB) provide principled solutions, efficient SOC solvers like…

机器学习 · 统计学 2026-02-10 Wei Guo , Yuchen Zhu , Xiaochen Du , Juno Nam , Yongxin Chen , Rafael Gómez-Bombarelli , Guan-Horng Liu , Molei Tao , Jaemoo Choi

Information Geometry of Absorbing Markov-Chain and Discriminative Random Walks

Discriminative Random Walks (DRWs) are a simple yet powerful tool for semi-supervised node classification, but their theoretical foundations remain fragmentary. We revisit DRWs through the lens of information geometry, treating the family…

机器学习 · 统计学 2026-02-10 Masanari Kimura

Graph-based Semi-Supervised Learning via Maximum Discrimination

Semi-supervised learning (SSL) addresses the critical challenge of training accurate models when labeled data is scarce but unlabeled data is abundant. Graph-based SSL (GSSL) has emerged as a popular framework that captures data structure…

机器学习 · 统计学 2026-02-10 Nadav Katz , Ariel Jaffe

Fast Model Selection and Stable Optimization for Softmax-Gated Multinomial-Logistic Mixture of Experts Models

Mixture-of-Experts (MoE) architectures combine specialized predictors through a learned gate and are effective across regression and classification, but for classification with softmax multinomial-logistic gating, rigorous guarantees for…

机器学习 · 统计学 2026-02-10 TrungKhang Tran , TrungTin Nguyen , Md Abul Bashar , Nhat Ho , Richi Nayak , Christopher Drovandi

BFTS: Thompson Sampling with Bayesian Additive Regression Trees

Contextual bandits are a core technology for personalized mobile health interventions, where decision-making requires adapting to complex, non-linear user behaviors. While Thompson Sampling (TS) is a preferred strategy for these problems,…

机器学习 · 统计学 2026-02-10 Ruizhe Deng , Bibhas Chakraborty , Ran Chen , Yan Shuo Tan

On Generation in Metric Spaces

We study generation in separable metric instance spaces. We extend the language generation framework from Kleinberg and Mullainathan [2024] beyond countable domains by defining novelty through metric separation and allowing asymmetric…

机器学习 · 统计学 2026-02-10 Jiaxun Li , Vinod Raman , Ambuj Tewari

Fast and Robust Likelihood-Guided Diffusion Posterior Sampling with Amortized Variational Inference

Zero-shot diffusion posterior sampling offers a flexible framework for inverse problems by accommodating arbitrary degradation operators at test time, but incurs high computational cost due to repeated likelihood-guided updates. In…

机器学习 · 统计学 2026-02-10 Léon Zheng , Thomas Hirtz , Yazid Janati , Eric Moulines

Deep networks learn to parse uniform-depth context-free languages from local statistics

Understanding how the structure of language can be learned from sentences alone is a central question in both cognitive science and machine learning. Studies of the internal representations of Large Language Models (LLMs) support their…

机器学习 · 统计学 2026-02-10 Jack T. Parley , Francesco Cagnetta , Matthieu Wyart

Near-Universal Multiplicative Updates for Nonnegative Einsum Factorization

Despite the ubiquity of multiway data across scientific domains, there are few user-friendly tools that fit tailored nonnegative tensor factorizations. Researchers may use gradient-based automatic differentiation (which often struggles in…

机器学习 · 统计学 2026-02-10 John Hood , Aaron Schein

Optimal Decision-Making Based on Prediction Sets

Prediction sets can wrap around any ML model to cover unknown test outcomes with a guaranteed probability. Yet, it remains unclear how to use them optimally for downstream decision-making. Here, we propose a decision-theoretic framework…

机器学习 · 统计学 2026-02-10 Tao Wang , Edgar Dobriban

Small Gradient Norm Regret for Online Convex Optimization

This paper introduces a new problem-dependent regret measure for online convex optimization with smooth losses. The notion, which we call the $G^\star$ regret, depends on the cumulative squared gradient norm evaluated at the decision in…

机器学习 · 统计学 2026-02-10 Wenzhi Gao , Chang He , Madeleine Udell

Local EGOP for Continuous Index Learning

We introduce the setting of continuous index learning, in which a function of many variables varies only along a small number of directions at each point. For efficient estimation, it is beneficial for a learning algorithm to adapt, near…

机器学习 · 统计学 2026-02-10 Alex Kokot , Anand Hemmady , Vydhourie Thiyageswaran , Marina Meila

Calibrated Multi-Level Quantile Forecasting

We develop an online method that guarantees calibration of quantile forecasts at multiple quantile levels simultaneously. In this work, a sequence of quantile forecasts is said to be calibrated provided that its $\alpha$-level predictions…

机器学习 · 统计学 2026-02-10 Tiffany Ding , Isaac Gibbs , Ryan J. Tibshirani