机器学习 — Scifaro

Shallow Neural Networks Learn Low-Degree Spherical Polynomials with Feature Learning by Learnable Channel Attention

We study the problem of learning a low-degree spherical polynomial of degree $\ell_0 = \Theta(1) \ge 1$ defined on the unit sphere in $\RR^d$ by training an over-parameterized two-layer neural network (NN) with channel attention in this…

机器学习 · 统计学 2026-04-28 Yingzhen Yang

Flexible Deep Neural Networks for Partially Linear Survival Data: Estimation and Survival Inference

We propose a flexible deep neural network (DNN) framework for modeling survival data within a partially linear regression structure. The approach preserves interpretability through a parametric linear component for covariates of primary…

机器学习 · 统计学 2026-04-28 Asaf Ben Arie , Malka Gorfine

Branching Flows: Discrete, Continuous, and Manifold Flow Matching with Splits and Deletions

Diffusion and flow matching approaches to generative modeling have shown promise in domains where the state space is continuous, such as image generation or protein folding & design, and discrete, exemplified by diffusion large language…

机器学习 · 统计学 2026-04-28 Lukas Billera , Hedwig Nora Nordlinder , Jack Collier Ryder , Anton Oresten , Aron Stålmarck , Theodor Mosetti Björk , Ben Murrell

Modeling Parkinson's Disease Progression Using Longitudinal Voice Biomarkers: A Comparative Study of Statistical and Neural Mixed-Effects Models

Longitudinal voice biomarkers provide a non-invasive source of information for monitoring Parkinson's disease progression, but their statistical analysis is difficult because repeated measurements from the same subject are correlated,…

机器学习 · 统计学 2026-04-28 Ran Tong , Lanruo Wang , Tong Wang , Wei Yan

Beyond ReLU: How Activations Affect Neural Kernels and Random Wide Networks

In recent years, the neural tangent kernel (NTK) and neural network Gaussian process kernel (NNGP) have given theoreticians tractable limiting cases of fully connected neural networks. However, the property of these kernels are poorly…

机器学习 · 统计学 2026-04-28 David Holzmüller , Max Schölpple

High-Dimensional Private Linear Regression with Optimal Rates

Differentially private (DP) linear regression has received significant attention in the recent theoretical literature, with several approaches proposed to improve error rates. Our work considers the popular high-dimensional regime with…

机器学习 · 统计学 2026-04-28 Simone Bombari , Jialei Luo , Inbar Seroussi , Marco Mondelli

Learning Operators by Regularized Stochastic Gradient Descent with Operator-valued Kernels

We consider a class of statistical inverse problems involving the estimation of a regression operator from a Polish space to a separable Hilbert space, where the target lies in a vector-valued reproducing kernel Hilbert space induced by an…

机器学习 · 统计学 2026-04-28 Jia-Qi Yang , Lei Shi

Statistical Test for Diffusion-Based Anomaly Localization via Selective Inference

Anomaly localization in images -- identifying regions that deviate from normal patterns -- is vital in applications such as medical diagnosis and industrial inspection. A recent trend is the use of image generation models in anomaly…

机器学习 · 统计学 2026-04-28 Teruyuki Katsuoka , Tomohiro Shiraishi , Daiki Miwa , Vo Nguyen Le Duy , Ichiro Takeuchi

Sub-linear Regret Bounds for Bayesian Optimisation in Unknown Search Spaces

Bayesian optimisation is a popular method for efficient optimisation of expensive black-box functions. Traditionally, BO assumes that the search space is known. However, in many problems, this assumption does not hold. To this end, we…

机器学习 · 统计学 2026-04-28 Hung Tran-The , Sunil Gupta , Santu Rana , Huong Ha , Svetha Venkatesh

Trading Convergence Rate with Computational Budget in High Dimensional Bayesian Optimization

Scaling Bayesian optimisation (BO) to high-dimensional search spaces is a active and open research problems particularly when no assumptions are made on function structure. The main reason is that at each iteration, BO requires to find…

机器学习 · 统计学 2026-04-28 Hung Tran-The , Sunil Gupta , Santu Rana , Svetha Venkatesh

CLVAE: A Variational Autoencoder for Long-Term Customer Revenue Forecasting

Predicting customers' long-term revenue from sparse and irregular transaction data is central to marketing resource allocation in non-contractual settings, yet existing approaches face a trade-off. Traditional probabilistic customer base…

机器学习 · 统计学 2026-04-27 Jeffrey Näf , Riana Valera Mbelson , Markus Meierer

Mixed Membership sub-Gaussian Models

The Gaussian mixture model is widely used in unsupervised learning, owing to its simplicity and interpretability. However, a fundamental limitation of the classical Gaussian mixture model is that it forces each observation to belong to…

机器学习 · 统计学 2026-04-27 Huan Qing

Explanation of Dynamic Physical Field Predictions using WassersteinGrad: Application to Autoregressive Weather Forecasting

As the demand to integrate Artificial Intelligence into high-stakes environments continues to grow, explaining the reasoning behind neural-network predictions has shifted from a theoretical curiosity to a strict operational requirement. Our…

机器学习 · 统计学 2026-04-27 Younes Essafouri , Laure Raynaud , Luciano Drozda , Laurent Risser

FedSPDnet: Geometry-Aware Federated Deep Learning with SPDnet

We introduce two federated learning frameworks for the classical SPDnet model operating on symmetric positive definite (SPD) matrices with Stiefel-constrained parameters. Unlike standard Euclidean averaging, which violates orthogonality,…

机器学习 · 统计学 2026-04-27 Thibault Pautrel , Florent Bouchard , Ammar Mian , Guillaume Ginolhac

Conformalized Super Learner

The Super Learner (SL) is a widely used ensemble method that combines predictions from a library of learners based on their predictive performance. Interval predictions are of considerable practical interest because they allow uncertainty…

机器学习 · 统计学 2026-04-27 Zhanli Wu , Fabrizio Leisen , Miguel-Angel Luque-Fernandez , F. Javier Rubio

Pack only the essentials: Adaptive dictionary learning for kernel ridge regression

One of the major limits of kernel ridge regression (KRR) is that storing and manipulating the kernel matrix K_n for n samples requires O(n^2) space, which rapidly becomes unfeasible for large n. Nystrom approximations reduce the space…

机器学习 · 统计学 2026-04-27 Daniele Calandriello , Alessandro Lazaric , Michal Valko

Pliable rejection sampling

Rejection sampling is a technique for sampling from difficult distributions. However, its use is limited due to a high rejection rate. Common adaptive rejection sampling methods either work only for very specific distributions or without…

机器学习 · 统计学 2026-04-27 Akram Erraqabi , Michal Valko , Alexandra Carpentier , Odalric-Ambrym Maillard

Sparse Network Inference under Imperfect Detection and its Application to Ecological Networks

Recovering latent structure from count data has received considerable attention in network inference, particularly when one seeks both cross-group interactions and within-group similarity patterns in bipartite networks, which is widely used…

机器学习 · 统计学 2026-04-27 Aoran Zhang , Tianyao Wei , Maria J. Guerrero , César A. Uribe

Distributional Off-Policy Evaluation with Deep Quantile Process Regression

This paper investigates the off-policy evaluation (OPE) problem from a distributional perspective. Rather than focusing solely on the expectation of the total return, as in most existing OPE methods, we aim to estimate the entire return…

机器学习 · 统计学 2026-04-27 Qi Kuang , Chao Wang , Yuling Jiao , Fan Zhou

FLUID: Flow-based Unified Inference for Dynamics

Bayesian filtering and smoothing for high-dimensional nonlinear dynamical systems are fundamental yet challenging problems in many areas of science and engineering. In this work, we propose FLUID, a flow-based unified amortized inference…

机器学习 · 统计学 2026-04-27 Tiangang Cui , Xiaodong Feng , Chenlong Pei , Xiaoliang Wan , Tao Zhou