机器学习 — Scifaro

OBLR-PO: A Theoretical Framework for Stable Reinforcement Learning

Existing reinforcement learning (RL)-based post-training methods for large language models have advanced rapidly, yet their design has largely been guided by heuristics rather than systematic theoretical principles. This gap limits our…

机器学习 · 统计学 2026-01-16 Zixun Huang , Jiayi Sheng , Zeyu Zheng

Relative Information Gain and Gaussian Process Regression

The sample complexity of estimating or maximising an unknown function in a reproducing kernel Hilbert space is known to be linked to both the effective dimension and the information gain associated with the kernel. While the information…

机器学习 · 统计学 2026-01-16 Hamish Flynn

Effects of Structural Allocation of Geometric Task Diversity in Linear Meta-Learning Models

Meta-learning aims to leverage information across related tasks to improve prediction on unlabeled data for new tasks when only a small number of labeled observations are available ("few-shot" learning). Increased task diversity is often…

机器学习 · 统计学 2026-01-16 Saptati Datta , Nicolas W. Hengartner , Yulia Pimonova , Natalie E. Klein , Nicholas Lubbers

Data-Driven Dynamic Factor Modeling via Manifold Learning

We introduce a data-driven dynamic factor framework for modeling the joint evolution of high-dimensional covariates and responses without parametric assumptions. Standard factor models applied to covariates alone often lose explanatory…

机器学习 · 统计学 2026-01-16 Graeme Baker , Agostino Capponi , J. Antonio Sidaoui

Autoencoding Random Forests

We propose a principled method for autoencoding with random forests. Our strategy builds on foundational results from nonparametric statistics and spectral graph theory to learn a low-dimensional embedding of the model that optimally…

机器学习 · 统计学 2026-01-16 Binh Duc Vu , Jan Kapar , Marvin Wright , David S. Watson

Sparse Nonparametric Contextual Bandits

We study the benefits of sparsity in nonparametric contextual bandit problems, in which the set of candidate features is countably or uncountably infinite. Our contribution is two-fold. First, using a novel reduction to sequences of…

机器学习 · 统计学 2026-01-16 Hamish Flynn , Julia Olkhovskaya , Paul Rognon-Vael

Exploring specialization and sensitivity of convolutional neural networks in the context of simultaneous image augmentations

Drawing parallels with the way biological networks are studied, we adapt the treatment--control paradigm to explainable artificial intelligence research and enrich it through multi-parametric input alterations. In this study, we propose a…

机器学习 · 统计学 2026-01-16 Pavel Kharyuk , Sergey Matveev , Ivan Oseledets

Horseshoe Mixtures-of-Experts (HS-MoE)

Horseshoe mixtures-of-experts (HS-MoE) models provide a Bayesian framework for sparse expert selection in mixture-of-experts architectures. We combine the horseshoe prior's adaptive global-local shrinkage with input-dependent gating,…

机器学习 · 统计学 2026-01-15 Nick Polson , Vadim Sokolov

Tail-Sensitive KL and R\'enyi Convergence of Unadjusted Hamiltonian Monte Carlo via One-Shot Couplings

Hamiltonian Monte Carlo (HMC) algorithms are among the most widely used sampling methods in high dimensional settings, yet their convergence properties are poorly understood in divergences that quantify relative density mismatch, such as…

机器学习 · 统计学 2026-01-15 Nawaf Bou-Rabee , Siddharth Mitra , Andre Wibisono

Coupling Generative Modeling and an Autoencoder with the Causal Bridge

We consider inferring the causal effect of a treatment (intervention) on an outcome of interest in situations where there is potentially an unobserved confounder influencing both the treatment and the outcome. This is achievable by assuming…

机器学习 · 统计学 2026-01-15 Ruolin Meng , Ming-Yu Chung , Dhanajit Brahma , Ricardo Henao , Lawrence Carin

Uncertainty-Aware PCA for Arbitrarily Distributed Data Modeled by Gaussian Mixture Models

Multidimensional data is often associated with uncertainties that are not well-described by normal distributions. In this work, we describe how such distributions can be projected to a low-dimensional space using uncertainty-aware principal…

机器学习 · 统计学 2026-01-15 Daniel Klötzl , Ozan Tastekin , David Hägele , Marina Evers , Daniel Weiskopf

Trustworthy scientific inference with generative models

Generative artificial intelligence (AI) excels at producing complex data structures (text, images, videos) by learning patterns from training examples. Across scientific disciplines, researchers are now applying generative models to…

机器学习 · 统计学 2026-01-15 James Carzon , Luca Masserano , Joshua D. Ingram , Alex Shen , Antonio Carlos Herling Ribeiro Junior , Tommaso Dorigo , Michele Doro , Joshua S. Speagle , Rafael Izbicki , Ann B. Lee

Multilevel neural simulation-based inference

Neural simulation-based inference (SBI) is a popular set of methods for Bayesian inference when models are only available in the form of a simulator. These methods are widely used in the sciences and engineering, where writing down a…

机器学习 · 统计学 2026-01-15 Yuga Hikida , Ayush Bharti , Niall Jeffrey , François-Xavier Briol

On the use of graph models to achieve individual and group fairness

Machine Learning algorithms are ubiquitous in key decision-making contexts such as justice, healthcare and finance, which has spawned a great demand for fairness in these procedures. However, the theoretical properties of such models in…

机器学习 · 统计学 2026-01-14 Arturo Pérez-Peralta , Sandra Benítez-Peña , Rosa E. Lillo

Robust low-rank estimation with multiple binary responses using pairwise AUC loss

Multiple binary responses arise in many modern data-analytic problems. Although fitting separate logistic regressions for each response is computationally attractive, it ignores shared structure and can be statistically inefficient,…

机器学习 · 统计学 2026-01-14 The Tien Mai

Structural Dimension Reduction in Bayesian Networks

This work introduces a novel technique, named structural dimension reduction, to collapse a Bayesian network onto a minimum and localized one while ensuring that probabilistic inferences between the original and reduced networks remain…

机器学习 · 统计学 2026-01-14 Pei Heng , Yi Sun , Jianhua Guo

Towards A Unified PAC-Bayesian Framework for Norm-based Generalization Bounds

Understanding the generalization behavior of deep neural networks remains a fundamental challenge in modern statistical learning theory. Among existing approaches, PAC-Bayesian norm-based bounds have demonstrated particular promise due to…

机器学习 · 统计学 2026-01-14 Xinping Yi , Gaojie Jin , Xiaowei Huang , Shi Jin

A Statistical Assessment of Amortized Inference Under Signal-to-Noise Variation and Distribution Shift

Since the turn of the century, approximate Bayesian inference has steadily evolved as new computational techniques have been incorporated to handle increasingly complex and large-scale predictive problems. The recent success of deep neural…

机器学习 · 统计学 2026-01-14 Roy Shivam Ram Shreshtth , Arnab Hazra , Gourab Mukherjee

Decentralized Online Convex Optimization with Unknown Feedback Delays

Decentralized online convex optimization (D-OCO), where multiple agents within a network collaboratively learn optimal decisions in real-time, arises naturally in applications such as federated learning, sensor networks, and multi-agent…

机器学习 · 统计学 2026-01-14 Hao Qiu , Mengxiao Zhang , Juliette Achddou

Adversarial Disentanglement by Backpropagation with Physics-Informed Variational Autoencoder

Inference and prediction under partial knowledge of a physical system is challenging, particularly when multiple confounding sources influence the measured response. Explicitly accounting for these influences in physics-based models is…

机器学习 · 统计学 2026-01-14 Ioannis Christoforos Koune , Alice Cicirello