机器学习 — Scifaro

Neural Networks on Symmetric Spaces of Noncompact Type

Recent works have demonstrated promising performances of neural networks on hyperbolic spaces and symmetric positive definite (SPD) manifolds. These spaces belong to a family of Riemannian manifolds referred to as symmetric spaces of…

机器学习 · 统计学 2026-01-06 Xuan Son Nguyen , Shuo Yang , Aymeric Histace

Fibonacci-Driven Recursive Ensembles: Algorithms, Convergence, and Learning Dynamics

This paper develops the algorithmic and dynamical foundations of recursive ensemble learning driven by Fibonacci-type update flows. In contrast with classical boosting Freund and Schapire (1997); Friedman (2001), where the ensemble evolves…

机器学习 · 统计学 2026-01-06 Ernest Fokoué

Beyond Demand Estimation: Consumer Surplus Evaluation via Cumulative Propensity Weights

This paper develops a practical framework for using observational data to audit the consumer surplus effects of AI-driven decisions, specifically in targeted pricing and algorithmic lending. Traditional approaches first estimate demand…

机器学习 · 统计学 2026-01-06 Zeyu Bian , Max Biggs , Ruijiang Gao , Zhengling Qi

Fast and Robust: Computationally Efficient Covariance Estimation for Sub-Weibull Vectors

High-dimensional covariance estimation is notoriously sensitive to outliers. While statistically optimal estimators exist for general heavy-tailed distributions, they often rely on computationally expensive techniques like semidefinite…

机器学习 · 统计学 2026-01-06 Even He

Sharp Structure-Agnostic Lower Bounds for General Linear Functional Estimation

We establish a general statistical optimality theory for estimation problems where the target parameter is a linear functional of an unknown nuisance component that must be estimated from data. This formulation covers many causal and…

机器学习 · 统计学 2026-01-06 Jikai Jin , Vasilis Syrgkanis

Comparison of neural network training strategies for the simulation of dynamical systems

Neural networks have become a widely adopted tool for modeling nonlinear dynamical systems from data. However, the choice of training strategy remains a key design decision, particularly for simulation tasks. This paper compares two…

机器学习 · 统计学 2026-01-06 Paul Strasser , Andreas Pfeffer , Jakob Weber , Markus Gurtner , Andreas Körner

Comparison of generalised additive models and neural networks in applications: A systematic review

Neural networks have become a popular tool in predictive modelling, more commonly associated with machine learning and artificial intelligence than with statistics. Generalised Additive Models (GAMs) are flexible non-linear statistical…

机器学习 · 统计学 2026-01-06 Jessica Doohan , Lucas Kook , Kevin Burke

Low-degree lower bounds via almost orthonormal bases

Low-degree polynomials have emerged as a powerful paradigm for providing evidence of statistical-computational gaps across a variety of high-dimensional statistical models [Wein25]. For detection problems -- where the goal is to test a…

机器学习 · 统计学 2026-01-06 Alexandra Carpentier , Simone Maria Giancola , Christophe Giraud , Nicolas Verzelen

GRAND: Graph Release with Assured Node Differential Privacy

Differential privacy is a well-established framework for safeguarding sensitive information in data. While extensively applied across various domains, its application to network data -- particularly at the node level -- remains…

机器学习 · 统计学 2026-01-06 Suqing Liu , Xuan Bi , Tianxi Li

Revisiting Randomization in Greedy Model Search

Feature subsampling is a core component of random forests and other ensemble methods. While recent theory suggests that this randomization acts solely as a variance reduction mechanism analogous to ridge regularization, these results…

机器学习 · 统计学 2026-01-06 Xin Chen , Jason M. Klusowski , Yan Shuo Tan , Chang Yu

Gibbs randomness-compression proposition: An efficient deep learning

A proposition that connects randomness and compression is put forward via Gibbs entropy over set of measurement vectors associated with a compression process. The proposition states that a lossy compression process is equivalent to {\it…

机器学习 · 统计学 2026-01-06 M. Süzen

A Linear Approach to Data Poisoning

Backdoor and data-poisoning attacks can flip predictions with tiny training corruptions, yet a sharp theory linking poisoning strength, overparameterization, and regularization is lacking. We analyze ridge least squares with an unpenalized…

机器学习 · 统计学 2026-01-06 Donald Flynn , Diego Granziol

Consistency for Large Neural Networks: Regression and Classification

Although overparameterized models have achieved remarkable practical success, their theoretical properties, particularly their generalization behavior, remain incompletely understood. The well known double descents phenomenon suggests that…

机器学习 · 统计学 2026-01-06 Haoran Zhan , Yingcun Xia

Any-Time Regret-Guaranteed Algorithm for Control of Linear Quadratic Systems

We propose a computationally efficient algorithm that achieves anytime regret of order $\mathcal{O}(\sqrt{t})$, with explicit dependence on the system dimensions and on the solution of the Discrete Algebraic Riccati Equation (DARE). Our…

机器学习 · 统计学 2026-01-06 Jafar Abbaszadeh Chekan , Cedric Langbort

Matrix Manifold Neural Networks++

Deep neural networks (DNNs) on Riemannian manifolds have garnered increasing interest in various applied areas. For instance, DNNs on spherical and hyperbolic manifolds have been designed to solve a wide range of computer vision and nature…

机器学习 · 统计学 2026-01-06 Xuan Son Nguyen , Shuo Yang , Aymeric Histace

Training More Robust Classification Model via Discriminative Loss and Gaussian Noise Injection

Robustness of deep neural networks to input noise remains a critical challenge, as naive noise injection often degrades accuracy on clean (uncorrupted) data. We propose a novel training framework that addresses this trade-off through two…

机器学习 · 统计学 2026-01-06 Hai-Vy Nguyen , Fabrice Gamboa , Sixin Zhang , Reda Chhaibi , Serge Gratton , Thierry Giaccone

Convergence of a L2 regularized Policy Gradient Algorithm for the Multi Armed Bandit

Although Multi Armed Bandit (MAB) on one hand and the policy gradient approach on the other hand are among the most used frameworks of Reinforcement Learning, the theoretical properties of the policy gradient algorithm used for MAB have not…

机器学习 · 统计学 2026-01-06 Stefana Anita , Gabriel Turinici

MFAI: A Scalable Bayesian Matrix Factorization Approach to Leveraging Auxiliary Information

In various practical situations, matrix factorization methods suffer from poor data quality, such as high data sparsity and low signal-to-noise ratio (SNR). Here, we consider a matrix factorization problem by utilizing auxiliary…

机器学习 · 统计学 2026-01-06 Zhiwei Wang , Fa Zhang , Cong Zheng , Xianghong Hu , Mingxuan Cai , Can Yang

Generative Conditional Missing Imputation Networks

In this study, we introduce a sophisticated generative conditional strategy designed to impute missing values within datasets, an area of considerable importance in statistical analysis. Specifically, we initially elucidate the theoretical…

机器学习 · 统计学 2026-01-05 George Sun , Yi-Hui Zhou

Detecting Unobserved Confounders: A Kernelized Regression Approach

Detecting unobserved confounders is crucial for reliable causal inference in observational studies. Existing methods require either linearity assumptions or multiple heterogeneous environments, limiting applicability to nonlinear…

机器学习 · 统计学 2026-01-05 Yikai Chen , Yunxin Mao , Chunyuan Zheng , Hao Zou , Shanzhi Gu , Shixuan Liu , Yang Shi , Wenjing Yang , Kun Kuang , Haotian Wang