机器学习 — Scifaro

Density-Matrix Spectral Embeddings for Categorical Data: Operator Structure and Stability

We introduce a supervised dimensionality reduction methodology for categorical (and discretized mixed-type) data based on a density-matrix construction induced by class-conditional frequencies. Given a labeled dataset encoded in a one-hot…

机器学习 · 统计学 2026-03-03 Raquel Bosch-Romeu , Antonio Falcó , osé-Antonio Rodríguez-Gallego

LOCUS: A Distribution-Free Loss-Quantile Score for Risk-Aware Predictions

Modern machine learning models can be accurate on average yet still make mistakes that dominate deployment cost. We introduce Locus, a distribution-free wrapper that produces a per-input loss-scale reliability score for a fixed prediction…

机器学习 · 统计学 2026-03-03 Matheus Barreto , Mário de Castro , Thiago R. Ramos , Denis Valle , Rafael Izbicki

Co-optimization for Adaptive Conformal Prediction

Conformal prediction (CP) provides finite-sample, distribution-free marginal coverage, but standard conformal regression intervals can be inefficient under heteroscedasticity and skewness. In particular, popular constructions such as…

机器学习 · 统计学 2026-03-03 Xiaoyi Su , Zhixin Zhou , Rui Luo

Causal Effects with Unobserved Unit Types in Interacting Human-AI Systems

We study experiments on interacting populations of humans and AI agents, where both unit types and the interaction network remain unobserved. Although causal effects propagate throughout the system, the goal is to estimate effects on…

机器学习 · 统计学 2026-03-03 William Overman , Sadegh Shirani , Mohsen Bayati

Adaptive Estimation and Inference in Conditional Moment Models via the Discrepancy Principle

We study adaptive estimation and inference in ill-posed linear inverse problems defined by conditional moment restrictions. Existing regularized estimators such as Regularized DeepIV (RDIV) require prior knowledge of the smoothness of the…

机器学习 · 统计学 2026-03-03 Jiyuan Tan , Vasilis Syrgkanis

Random Features for Operator-Valued Kernels: Bridging Kernel Methods and Neural Operators

In this work, we investigate the generalization properties of random feature methods. Our analysis extends prior results for Tikhonov regularization to a broad class of spectral regularization techniques and further generalizes the setting…

机器学习 · 统计学 2026-03-03 Mike Nguyen , Nicole Mücke

Learning with the Nash-Sutcliffe loss

The Nash-Sutcliffe efficiency ($\text{NSE}$) is a widely used, positively oriented relative measure for evaluating forecasts across multiple time series. However, it lacks a decision-theoretic foundation for this purpose. To address this,…

机器学习 · 统计学 2026-03-03 Hristos Tyralis , Georgia Papacharalampous

Time-Aware Latent Space Bayesian Optimization

Latent-space Bayesian optimization (LSBO) extends Bayesian optimization to structured domains, such as molecular design, by searching in the continuous latent space of a generative model. However, most LSBO methods assume a fixed objective,…

机器学习 · 统计学 2026-03-03 Tuan A. Vu , Julien Martinelli , Harri Lähdesmäki

Multivariate Spatio-Temporal Neural Hawkes Processes

We propose a Multivariate Spatio-Temporal Neural Hawkes Process for modeling complex multivariate event data with spatio-temporal dynamics. The proposed model extends continuous-time neural Hawkes processes by integrating spatial…

机器学习 · 统计学 2026-03-03 Christopher Chukwuemeka , Hojun You , Mikyoung Jun

Smoothness Adaptivity in Constant-Depth Neural Networks: Optimal Rates via Smooth Activations

Smooth activation functions are ubiquitous in modern deep learning, yet their theoretical advantages over non-smooth counterparts remain poorly understood. In this work, we study both approximation and statistical properties of neural…

机器学习 · 统计学 2026-03-03 Yuhao Liu , Zilin Wang , Lei Wu , Shaobo Zhang

Random Forests as Statistical Procedures: Design, Variance, and Dependence

We develop a finite-sample, design-based theory for random forests in which each tree is a randomized conditional predictor acting on fixed covariates and the forest is their Monte Carlo average. An exact variance identity separates Monte…

机器学习 · 统计学 2026-03-03 Nathaniel S. O'Connell

Relaxed Triangle Inequality for Kullback-Leibler Divergence Between Multivariate Gaussian Distributions

The Kullback-Leibler (KL) divergence is not a proper distance metric and does not satisfy the triangle inequality, posing theoretical challenges in certain practical applications. Existing work has demonstrated that KL divergence between…

机器学习 · 统计学 2026-03-03 Shiji Xiao , Yufeng Zhang , Chubo Liu , Yan Ding , Keqin Li , Kenli Li

DoFlow: Flow-based Generative Models for Interventional and Counterfactual Forecasting on Time Series

Time-series forecasting increasingly demands not only accurate observational predictions but also causal forecasting under interventional and counterfactual queries in multivariate systems. We present DoFlow, a flow-based generative model…

机器学习 · 统计学 2026-03-03 Dongze Wu , Feng Qiu , Yao Xie

Optimal Stopping in Latent Diffusion Models

We identify and analyze a surprising phenomenon of Latent Diffusion Models (LDMs) where the final steps of the diffusion can degrade sample quality. In contrast to conventional arguments that justify early stopping for numerical stability,…

机器学习 · 统计学 2026-03-03 Yu-Han Wu , Quentin Berthet , Gérard Biau , Claire Boyer , Romuald Elie , Pierre Marion

Fourier Analysis on the Boolean Hypercube via Hoeffding Functional Decomposition

Fourier analysis on the Boolean hypercube is fundamentally defined as the orthogonal decomposition of the space of pseudo-Boolean functions with respect to the uniform probability measure. In this work, we propose an ANOVA-based…

机器学习 · 统计学 2026-03-03 Baptiste Ferrere , Nicolas Bousquet , Fabrice Gamboa , Jean-Michel Loubes , Joseph Muré

A universal compression theory for lottery ticket hypothesis and neural scaling laws

When training large-scale models, the performance typically scales with the number of parameters and the dataset size according to a slow power law. A fundamental theoretical and practical question is whether comparable performance can be…

机器学习 · 统计学 2026-03-03 Hong-Yi Wang , Di Luo , Tomaso Poggio , Isaac L. Chuang , Liu Ziyin

Estimating Dimensionality of Neural Representations from Finite Samples

The global dimensionality of a neural representation manifold provides rich insight into the computational process underlying both artificial and biological neural networks. However, all existing measures of global dimensionality are…

机器学习 · 统计学 2026-03-03 Chanwoo Chun , Abdulkadir Canatar , SueYeon Chung , Daniel Lee

A Projection-Based ARIMA Framework for Nonlinear Dynamics in Macroeconomic and Financial Time Series: Closed-Form Estimation and Rolling-Window Inference

We introduce Galerkin-ARIMA and Galerkin-SARIMA, a projection-based extension of classical ARIMA/SARIMA that replaces rigid linear lag operators with low-dimensional Galerkin basis expansions while preserving the familiar AR-MA…

机器学习 · 统计学 2026-03-03 Haojie Liu , Zihan Lin

Low-Rank Thinning

The goal in thinning is to summarize a dataset using a small set of representative points. Remarkably, sub-Gaussian thinning algorithms like Kernel Halving and Compress can match the quality of uniform subsampling while substantially…

机器学习 · 统计学 2026-03-03 Annabelle Michael Carrell , Albert Gong , Abhishek Shetty , Raaz Dwivedi , Lester Mackey

On weight and variance uncertainty in neural networks for regression tasks

We investigate the problem of weight uncertainty originally proposed by [Blundell et al. (2015). Weight uncertainty in neural networks. In International conference on machine learning, 1613-1622, PMLR.] in the context of neural networks…

机器学习 · 统计学 2026-03-03 Moein Monemi , Morteza Amini , S. Mahmoud Taheri , Mohammad Arashi