机器学习 — Scifaro

Impact of Positional Encoding: Clean and Adversarial Rademacher Complexity for Transformers under In-Context Regression

Positional encoding (PE) is a core architectural component of Transformers, yet its impact on the Transformer's generalization and robustness remains unclear. In this work, we provide the first generalization analysis for a single-layer…

机器学习 · 统计学 2026-03-25 Weiyi He , Yue Xing

Riesz Regression As Direct Density Ratio Estimation

This study clarifies the relationship between Riesz regression [Chernozhukov et al., 2021] and density ratio estimation (DRE) in causal inference problems, such as average treatment effect estimation. We first show that the Riesz…

机器学习 · 统计学 2026-03-25 Masahiro Kato

Geopolitics, Geoeconomics, and Sovereign Risk: Different Shocks, Different Channels

Geopolitical and geoeconomic shocks reprice sovereign credit risk through different transmission channels. Using a daily panel of 42 advanced and emerging economies over 2018--2025, we show that geopolitical shocks raise sovereign CDS…

机器学习 · 统计学 2026-03-25 Alvaro Ortiz , Tomasa Rodrigo , Pablo Saborido

Graph Distribution-valued Signals: A Wasserstein Space Perspective

We introduce a novel framework for graph signal processing (GSP) that models signals as graph distribution-valued signals (GDSs), which are probability distributions in the Wasserstein space. This approach overcomes key limitations of…

机器学习 · 统计学 2026-03-25 Yanan Zhao , Feng Ji , Xingchao Jian , Wee Peng Tay

Prediction-Powered Inference with Inverse Probability Weighting

Prediction-powered inference (PPI) is a recent framework for valid statistical inference with partially labeled data, combining model-based predictions on a large unlabeled set with bias correction from a smaller labeled subset. Building on…

机器学习 · 统计学 2026-03-25 Jyotishka Datta , Nicholas G. Polson

Clusterpath Gaussian Graphical Modeling

Graphical models serve as effective tools for visualizing conditional dependencies between variables. However, as the number of variables grows, interpretation becomes increasingly difficult, and estimation uncertainty increases due to the…

机器学习 · 统计学 2026-03-25 D. J. W. Touw , A. Alfons , P. J. F. Groenen , I. Wilms

Inference of Multiscale Gaussian Graphical Model

Gaussian Graphical Models (GGMs) are widely used in high-dimensional data analysis to synthesize the interaction between variables. In many applications, such as genomics or image analysis, graphical models rely on sparsity and clustering…

机器学习 · 统计学 2026-03-25 Do Edmond Sanou , Christophe Ambroise , Geneviève Robin

Deep Learning Estimation of Absorbed Dose for Nuclear Medicine Diagnostics

The distribution of absorbed dose in radionuclide therapy with Lu$^{177}$ can be approximated by convolving an image of the time-integrated activity distribution with a dose voxel kernel representing different tissue types. This fast but…

机器学习 · 统计学 2026-03-25 Luciano Melodia

MAGPI: Multifidelity-Augmented Gaussian Process Inputs for Surrogate Modeling from Scarce Data

Supervised machine learning describes the practice of fitting a parameterized model to labeled input-output data. Supervised machine learning methods have demonstrated promise in learning efficient surrogate models that can (partially)…

机器学习 · 统计学 2026-03-24 Atticus Rex , Elizabeth Qian , David Peterson

Structural Concentration in Weighted Networks: A Class of Topology-Aware Indices

This paper develops a unified framework for measuring concentration in weighted systems embedded in networks of interactions. While traditional indices such as the Herfindahl-Hirschman Index capture dispersion in weights, they neglect the…

机器学习 · 统计学 2026-03-24 L. Riso , M. G. Zoia

CoNBONet: Conformalized Neuroscience-inspired Bayesian Operator Network for Reliability Analysis

Time-dependent reliability analysis of nonlinear dynamical systems under stochastic excitations is a critical yet computationally demanding task. Conventional approaches, such as Monte Carlo simulation, necessitate repeated evaluations of…

机器学习 · 统计学 2026-03-24 Shailesh Garg , Souvik Chakraborty

Generalized Discrete Diffusion from Snapshots

We introduce Generalized Discrete Diffusion from Snapshots (GDDS), a unified framework for discrete diffusion modeling that supports arbitrary noising processes over large discrete state spaces. Our formulation encompasses all existing…

机器学习 · 统计学 2026-03-24 Oussama Zekri , Théo Uscidda , Nicolas Boullé , Anna Korba

Accelerate Vector Diffusion Maps by Landmarks

We propose a landmark-constrained algorithm, LA-VDM (Landmark Accelerated Vector Diffusion Maps), to accelerate the Vector Diffusion Maps (VDM) framework built upon the Graph Connection Laplacian (GCL), which captures pairwise connection…

机器学习 · 统计学 2026-03-24 Sing-Yuan Yeh , Yi-An Wu , Hau-Tieng Wu , Mao-Pei Tsui

Domain Elastic Transform: Bayesian Function Registration for High-Dimensional Scientific Data

Nonrigid registration is conventionally divided into point set registration, which aligns sparse geometries, and image registration, which aligns continuous intensity fields on regular grids. However, this dichotomy creates a critical…

机器学习 · 统计学 2026-03-24 Osamu Hirose , Emanuele Rodola

Time-adaptive functional Gaussian Process regression

This paper proposes a new formulation of functional Gaussian Process regression in manifolds, based on an Empirical Bayes approach, in the spatiotemporal random field context. We apply the machinery of tight Gaussian measures in separable…

机器学习 · 统计学 2026-03-24 MD Ruiz-Medina , AE Madrid , A Torres-Signes , JM Angulo

Stochastic approximation in non-markovian environments revisited

Based on some recent work of the author on stochastic approximation in non-markovian environments, the situation when the driving random process is non-ergodic in addition to being non-markovian is considered. Using this, we propose an…

机器学习 · 统计学 2026-03-24 Vivek Shripad Borkar

Gradient Descent with Projection Finds Over-Parameterized Neural Networks for Learning Low-Degree Polynomials with Nearly Minimax Optimal Rate

We study the problem of learning a low-degree spherical polynomial of degree $k_0 = \Theta(1) \ge 1$ defined on the unit sphere in $\RR^d$ by training an over-parameterized two-layer neural network with augmented feature in this paper. Our…

机器学习 · 统计学 2026-03-24 Yingzhen Yang , Ping Li

Hard labels sampled from sparse targets mislead rotation invariant algorithms

One of the most common machine learning setups is logistic regression. In many classification models, including neural networks, the final prediction is obtained by applying a logistic link function to a linear score. In binary logistic…

机器学习 · 统计学 2026-03-24 Avrajit Ghosh , Bin Yu , Manfred Warmuth , Peter Bartlett

Stability of Sequential and Parallel Coordinate Ascent Variational Inference

We highlight a striking difference in behavior between two widely used variants of coordinate ascent variational inference: the sequential and parallel algorithms. While such differences were known in the numerical analysis literature in…

机器学习 · 统计学 2026-03-24 Debdeep Pati

Active Inference for Physical AI Agents -- An Engineering Perspective

Physical AI agents, such as robots and other embodied systems operating under tight and fluctuating resource constraints, remain far less capable than biological agents in open-ended real-world environments. This paper argues that Active…

机器学习 · 统计学 2026-03-24 Bert de Vries