机器学习 — Scifaro

mlr3torch: A Deep Learning Framework in R based on mlr3 and torch

Deep learning (DL) has become a cornerstone of modern machine learning (ML) praxis. We introduce the R package mlr3torch, which is an extensible DL framework for the mlr3 ecosystem. It is built upon the torch package, and simplifies the…

机器学习 · 统计学 2026-04-21 Sebastian Fischer , Lukas Burk , Carson Zhang , Bernd Bischl , Martin Binder

StrEBM: A Structured Latent Energy-Based Model for Blind Source Separation

This paper proposes StrEBM, a structured latent energy-based model for source-wise structured representation learning. The framework is motivated by a broader goal of promoting identifiable and decoupled latent organization by assigning…

机器学习 · 统计学 2026-04-21 Yuan-Hao Wei

PAC-Bayes Bounds for Gibbs Posteriors via Singular Learning Theory

We derive explicit non-asymptotic PAC-Bayes generalization bounds for Gibbs posteriors, that is, data-dependent distributions over model parameters obtained by exponentially tilting a prior with the empirical risk. Unlike classical…

机器学习 · 统计学 2026-04-21 Chenyang Wang , Yun Yang

Forecast Sports Outcomes under Efficient Market Hypothesis: Theoretical and Experimental Analysis of Odds-Only and Generalised Linear Models

Converting betting odds into accurate outcome probabilities is a fundamental challenge in order to use betting odds as a benchmark for sports forecasting and market efficiency analysis. In this study, we propose two methods to overcome the…

机器学习 · 统计学 2026-04-21 Kaito Goto , Naoya Takeishi , Takehisa Yairi

Neighbor Embedding for High-Dimensional Sparse Poisson Data

Across many scientific fields, measurements often represent the number of times an event occurs. For example, a document can be represented by word occurrence counts, neural activity by spike counts per time window, or online communication…

机器学习 · 统计学 2026-04-21 Noga Mudrik , Adam S. Charles

Extraction of informative statistical features in the problem of forecasting time series generated by It{\^{o}}-type processes

In this paper, we consider the problem of extraction of most informative features from time series that are regarded as observed values of stochastic processes satisfying the It{\^{o}} stochastic differential equations with unknown random…

机器学习 · 统计学 2026-04-21 Victor Korolev , Mikhail Ivanov , Tatiana Kukanova , Artyom Rukavitsa , Alexander Vakshin , Peter Solomonov , Alexander Zeifman

A Mechanism Study of Delayed Loss Spikes in Batch-Normalized Linear Models

Delayed loss spikes have been reported in neural-network training, but existing theory mainly explains earlier non-monotone behavior caused by overly large fixed learning rates. We study one stylized hypothesis: normalization can postpone…

机器学习 · 统计学 2026-04-21 Peifeng Gao , Wenyi Fang , Yang Zheng , Difan Zou

Fairness Constraints in High-Dimensional Generalized Linear Models

Machine learning models often inherit biases from historical data, raising critical concerns about fairness and accountability. Conventional fairness interventions typically require access to sensitive attributes like gender or race, but…

机器学习 · 统计学 2026-04-21 Yixiao Lin , James Booth

Unsupervised feature selection using Bayesian Tucker decomposition

In this paper, we proposed Bayesian Tucker decomposition (BTuD) in which residual is supposed to obey Gaussian distribution analogous to linear regression. Although we have proposed an algorithm to perform the proposed BTuD, the…

机器学习 · 统计学 2026-04-21 Y-h. Taguchi , Yoh-ichi Mototake

Differentially Private Conformal Prediction

Conformal prediction (CP) has attracted broad attention as a simple and flexible framework for uncertainty quantification through prediction sets. In this work, we study how to deploy CP under differential privacy (DP) in a statistically…

机器学习 · 统计学 2026-04-21 Jiamei Wu , Ce Zhang , Zhipeng Cai , Jingsen Kong , Bei Jiang , Linglong Kong , Lingchen Kong

Conformal Risk Control under Non-Monotone Losses: Theory and Finite-Sample Guarantees

Conformal risk control (CRC) provides distribution-free guarantees for controlling the expected loss at a user-specified level. Existing theory typically assumes that the loss decreases monotonically with a tuning parameter that governs the…

机器学习 · 统计学 2026-04-21 Tareq Aldirawi , Yun Li , Wenge Guo

A Sensitivity Approach to Causal Inference Under Limited Overlap

Limited overlap between treated and control groups is a key challenge in observational analysis. Standard approaches like trimming importance weights can reduce variance but introduce a fundamental bias. We propose a sensitivity framework…

机器学习 · 统计学 2026-04-21 Yuanzhe Ma , Yian Huang , Hongseok Namkoong

Efficient Inference for Coupled Hidden Markov Models in Continuous Time and Discrete Space

Systems of interacting continuous-time Markov chains are a powerful model class, but inference is typically intractable in high dimensional settings. Auxiliary information, such as noisy observations, is typically only available at discrete…

机器学习 · 统计学 2026-04-21 Giosue Migliorini , Padhraic Smyth

On the Theory of Continual Learning with Gradient Descent for Neural Networks

Continual learning, the ability of a model to adapt to an ongoing sequence of tasks without forgetting earlier ones, is a central goal of artificial intelligence. To better understand its underlying mechanisms, we study the limitations of…

机器学习 · 统计学 2026-04-21 Hossein Taheri , Avishek Ghosh , Arya Mazumdar

Policy Testing in Markov Decision Processes

We study the policy testing problem in discounted Markov decision processes (MDPs) in the fixed-confidence setting under a generative model with static sampling. The goal is to decide whether the value of a given policy exceeds a specified…

机器学习 · 统计学 2026-04-21 Kaito Ariu , Po-An Wang , Alexandre Proutiere , Kenshi Abe

Introducing the O-Value: A Universal Standardization for Confusion-Matrix-Based Classification Performance Metrics

Many classification performance metrics exist, each suited to a specific application. However, these metrics often differ in scale and can exhibit varying sensitivity to class imbalance rates in the test set. As a result, it is difficult to…

机器学习 · 统计学 2026-04-21 Ningsheng Zhao , Trang Bui , Jia Yuan Yu , Krzysztof Dzieciolowski

A Scalable Nystrom-Based Kernel Two-Sample Test with Permutations

Two-sample hypothesis testing-determining whether two sets of data are drawn from the same distribution-is a fundamental problem in statistics and machine learning with broad scientific applications. In the context of nonparametric testing,…

机器学习 · 统计学 2026-04-21 Antoine Chatalic , Marco Letizia , Nicolas Schreuder , Lorenzo Rosasco

Bayesian Neural Networks: An Introduction and Survey

Neural Networks (NNs) have provided state-of-the-art results for many challenging machine learning tasks such as detection, regression and classification across the domains of computer vision, speech recognition and natural language…

机器学习 · 统计学 2026-04-21 Ethan Goan , Clinton Fookes

Adaptive multi-fidelity optimization with fast learning rates

In multi-fidelity optimization, biased approximations of varying costs of the target function are available. This paper studies the problem of optimizing a locally smooth function with a limited budget, where the learner has to make a…

机器学习 · 统计学 2026-04-20 Come Fiegel , Victor Gabillon , Michal Valko

PRIM-cipal components analysis

Supervised No Free Lunch Theorems (NFLTs) are well studied, yet unsupervised NFLTs remain underexplored. For elliptical distributions, we prove that there exist two equally optimal, scientifically meaningful bump-hunting strategies that are…

机器学习 · 统计学 2026-04-20 Tianhao Liu , Daniel Andrés Díaz-Pachón , J. Sunil Rao