机器学习 — Scifaro

Characterizing and Identifying Separable Graphical Models

We study a broad class of graphical models whose independencies correspond to vertex separation in mixed graphs with directed, undirected, and bidirected edges, that are capable of encoding independence structures arising from feedback,…

机器学习 · 统计学 2026-07-01 Christopher Meek , Kayvan Sadeghi

Function-Counting Theory for Low-Dimensional Data Structures

The success of deep learning models in classification and regression is widely attributed to the low-dimensional structure that real-world data tend to exhibit, despite their high-dimensional representation. This work attempts to provide a…

机器学习 · 统计学 2026-07-01 Konstantin Häberle , Helmut Bölcskei

Deep Multitask Learning for Mixed-Type Outcomes with Shared Sparsity

Most existing multitask learning approaches are limited by their reliance on task-specific loss functions tailored to the scale and type of each outcome. When outcomes differ across tasks, these losses are generally not directly comparable,…

机器学习 · 统计学 2026-07-01 Huichao Li , Tong Wang , Sanguo Zhang , Shuangge Ma

Hierarchical Variational Kalman Filtering

Traditional variational Kalman filtering with unknown noise statistics suffers from inconsistent process covariance estimation and slow convergence speed, limiting its practical utility. To address these issues, we introduce a surrogate…

机器学习 · 统计学 2026-07-01 Shilei Li , Dawei Shi , Wei Zheng , Ling Shi

Neural Network-Based Estimation of Time-Dependent Parameters in AR(p) Processes

We investigate a forecasting framework based on a simple discrete-time dynamic model with coefficients varying in time. The parameters of the model are recovered within a deep learning framework, which makes it possible to retain a…

机器学习 · 统计学 2026-07-01 Agnieszka Kopeć , Paweł Przybyłowicz , Martyna Wiącek

From Spectral Methods to Sample Complexity Bounds for Fourier Neural Operators

We establish approximation and learning guarantees for Fourier neural operators (FNOs) applied to time-$T$ solution operators of dissipative evolution equations. The analysis builds on the premise that FNOs can efficiently approximate and…

机器学习 · 统计学 2026-07-01 Nisha Chandramoorthy , Daniel Sanz-Alonso , Nathan Waniorek

Accelerating Conformal Prediction via Approximate Leave-One-Out

While conformal prediction provides a general framework for uncertainty quantification in predictive inference, its application is often limited by computational cost. Recent methods, including Jackknife+ and Jackknife-minmax, achieve…

机器学习 · 统计学 2026-06-30 Jiachen Cong , Jingbo Liu

MNAR-$k$-means: A $k$-means Clustering for Data Missing Not at Random with Magnitude-Decaying Probability

The classical $k$-means clustering, based on distances computed from all data features, cannot be directly applied to incomplete data with missing values. A natural extension of $k$-means to missing data is to involve only the observed…

机器学习 · 统计学 2026-06-30 Xin Guan

Dynamic Gaussian Processes and the Vanilla-SPDE Exchange

Gaussian process inference is often limited by cubic computational costs, a challenge that becomes more pronounced in spatio-temporal settings where posterior inference is required over dense grids. While state-space SPDE formulations…

机器学习 · 统计学 2026-06-30 Rui-Yang Zhang , Lachlan Astfalck , Edward Cripps , David Leslie , Henry Moss

SGD at the Edge of Stability: Stochastic Stabilization with Large Learning Rates

Modern deep learning has been shown to operate at the edge of stability, routinely using learning rates far larger than those justified by classical optimization theory. Most prior analyses of the edge of stability phenomenon focus on…

机器学习 · 统计学 2026-06-29 Konstantinos Emmanouilidis , Lachlan MacDonald , Salma Tarmoun , Rene Vidal

Dynamic Prediction of Alternating Recurrent Events via Neural Network

Alternating recurrent events -- event-times of a specific nature that trigger a secondary refractory period -- occur in a wide-range of fields, including behavioral science, criminal justice, and biostatistics. Analysis of these events…

机器学习 · 统计学 2026-06-29 Abigail Loe , Susan Murry , Zhenke Wu

Separation Capacity of Scattering Networks

In this paper, we attempt to enhance the theoretical understanding of convolutional neural networks (CNNs) as feature extractors in classification tasks by analyzing them through the lens of Cover's function-counting theory. Specifically,…

机器学习 · 统计学 2026-06-29 Konstantin Häberle , Helmut Bölcskei

Optimization Dynamics Imprint Semantic Specificity in Contrastive Embedding Norms

Contrastive embedding models trained with scale-invariant losses are typically paired with distance metrics like cosine similarity, effectively ignoring embedding magnitudes. However, surprisingly, empirical studies reveal that despite…

机器学习 · 统计学 2026-06-29 Ziwei Su , Junyu Ren , Victor Veitch

Doubly Robust Adaptive Conformal Inference for Causal Effects Under Temporal Dependence

We propose doubly robust adaptive conformal inference (DR-ACI), which constructs prediction intervals for doubly robust pseudo-outcomes under temporal dependence.

机器学习 · 统计学 2026-06-29 Andreas Koukorinis , Ricardo Silva

Factorizable Normalizing Flows for parameter-dependent density morphing

Normalizing Flows excel at modeling a single fixed density, yet many problems across the sciences, such as high energy physics, instead require modeling how that density deforms as a function of continuous parameters: the strength of a…

机器学习 · 统计学 2026-06-29 Davide Valsecchi , Mauro Donegà , Rainer Wallny

Non-parametric recovery of causal diffusion mechanisms from steady-state observations

We consider sparse multivariate stochastic systems that evolve in continuous time according to a causal mechanism and present methodology to recover the system's time-infinitesimal transition mechanism from mere cross-sectional data. This…

机器学习 · 统计学 2026-06-29 Richard Schwank , Mathias Drton

SGD Provably Prioritizes a Shortcut Spurious Feature in the XOR Model

Neural networks are known to be susceptible to over-reliance on spurious correlations. However, the precise mechanism by which models exploit shortcut features is not fully understood, and algorithms to mitigate this behavior rely on as yet…

机器学习 · 统计学 2026-06-29 Tyler LaBonte , Vidya Muthukumar

A Stochastic--Geometric Theory of Scaling Laws in Grokking

Delayed generalization (\ie~grokking) refers to the phenomenon in which a neural network fits its training data early in training but only begins to generalize after a prolonged delay, often through an abrupt transition. Despite extensive…

机器学习 · 统计学 2026-06-29 Róisín Luo , Christian Gagné , Jonas Ngnawé , Ihsan Ullah , Karyn Morrissey

Extrapolating from Regularised Solutions for Solving Ill-Conditioned Linear Systems in Machine Learning

Rapid prototyping of algorithms is a critical step in modern machine learning. Most algorithms exploit linear algebra, creating a need for lightweight numerical routines which -- while potentially sub-optimal for the task at hand -- can be…

机器学习 · 统计学 2026-06-29 Disha Hegde , Jon Cockayne , Chris. J. Oates

Highly Data Parallelizable Estimation of the Sliced-Wasserstein Distance Using Cumulative Distribution Functions

The Sliced Wasserstein (SW) distance has emerged as a computationally attractive alternative to the Wasserstein distance by leveraging one-dimensional optimal transport along random projections. Standard estimators of the SW distance rely…

机器学习 · 统计学 2026-06-29 Christophe Vauthier , Quentin Mérigot , Anna Korba