机器学习 — Scifaro

PRCD-MAP: Learning How Much to Trust Imperfect Priors in Causal Discovery

External priors of unknown reliability create a brittle trade-off in causal discovery: blind trust amplifies errors, blind rejection wastes signal. Real priors are also heterogeneously reliable -- physical laws are trustworthy,…

机器学习 · 统计学 2026-05-08 Xihang Shan , Da Zhou

A Novel Computational Framework for Causal Inference: Tree-Based Discretization with ILP-Based Matching

Causal inference is essential for data-driven decision-making, as it aims to uncover causal relationships from observational data. However, identifying causality remains challenging due to the potential for confounding and the distinction…

机器学习 · 统计学 2026-05-08 Tianyu Yang , Md. Noor-E-Alam

Post-Selection Distributional Model Evaluation

Formal model evaluation methods typically certify that a model satisfies a prescribed target key performance indicator (KPI) level. However, in many applications, the relevant target KPI level may not be known a priori, and the user may…

机器学习 · 统计学 2026-05-08 Amirmohammad Farzaneh , Osvaldo Simeone

Filtered Spectral Projection for Quantum Principal Component Analysis

Quantum principal component analysis (qPCA) is commonly formulated as the extraction of eigenvalues and eigenvectors of a covariance-encoded density operator. Yet in many qPCA settings the practical goal is simpler: projection onto the…

机器学习 · 统计学 2026-05-08 Sk Mujaffar Hossain , Satadeep Bhattacharjee

Does Sparse Connectivity Improve Generalization? Convolutional Networks Below the Edge of Stability

Gradient descent on overparameterized neural networks typically operates at the Edge of Stability (EoS), where the largest Hessian eigenvalue hovers around a step-size-dependent threshold. We study how sparse connectivity changes…

机器学习 · 统计学 2026-05-08 Tongtong Liang , Esha Singh , Rahul Parhi , Alexander Cloninger , Yu-Xiang Wang

A Basin-Selection Perspective on Grokking via Singular Learning Theory

Grokking, the abrupt transition from memorization to generalisation after extended training, suggests the presence of competing solution basins with distinct statistical properties. We study this phenomenon through the lens of Singular…

机器学习 · 统计学 2026-05-08 Ben Cullen , Sergio Estan-Ruiz , Riya Danait , Jiayi Li

Is Flow Matching Just Trajectory Replay for Sequential Data?

Flow matching (FM) is increasingly used in scientific domains for time series generation and forecasting, where data often arise from underlying dynamical systems. However, it is not well-understood whether it learns transferable dynamical…

机器学习 · 统计学 2026-05-08 Soon Hoe Lim , Shizheng Lin , Michael W. Mahoney , N. Benjamin Erichson

Flow-Based Conformal Predictive Distributions

Conformal prediction provides a distribution-free framework for uncertainty quantification via prediction sets with exact finite-sample coverage. In low dimensions these sets are easy to interpret, but in high-dimensional or structured…

机器学习 · 统计学 2026-05-08 Trevor Harris

Principled Federated Random Forests for Heterogeneous Data

Random Forests (RF) are among the most powerful and widely used predictive models for centralized tabular data, yet few methods exist to adapt them to the federated learning setting. Unlike most federated learning approaches, the…

机器学习 · 统计学 2026-05-08 Rémi Khellaf , Erwan Scornet , Aurélien Bellet , Julie Josse

Generative Modeling of Discrete Data Using Geometric Latent Subspaces

We propose a geometric latent-subspace framework for generative modeling of discrete data. Specifically, we introduce latent subspaces in the exponential parameter space of product manifolds of categorical distributions as a novel method…

机器学习 · 统计学 2026-05-08 Daniel Gonzalez-Alvarado , Jonas Cassel , Stefania Petra , Christoph Schnörr

Don't Throw Away Your Beams: Improving Consistency-based Uncertainties in LLMs via Beam Search

Consistency-based methods have emerged as an effective approach to uncertainty quantification (UQ) in large language models. These methods typically rely on several generations obtained via multinomial sampling, measuring their agreement…

机器学习 · 统计学 2026-05-08 Ekaterina Fadeeva , Maiya Goloburda , Aleksandr Rubashevskii , Roman Vashurin , Artem Shelmanov , Preslav Nakov , Mrinmaya Sachan , Maxim Panov

Optimal In-context Adaptivity and Distributional Robustness of Transformers

We study in-context learning problems where a Transformer is pretrained on tasks drawn from a mixture distribution $\pi=\sum_{\alpha\in\mathcal{A}} \lambda_{\alpha} \pi_{\alpha}$, called the pretraining prior, in which each mixture…

机器学习 · 统计学 2026-05-08 Tianyi Ma , Tengyao Wang , Richard J. Samworth

Generalization Below the Edge of Stability: The Role of Data Geometry

Understanding generalization in overparameterized neural networks hinges on the interplay between the data geometry, neural architecture, and training dynamics. In this paper, we theoretically explore how data geometry controls this…

机器学习 · 统计学 2026-05-08 Tongtong Liang , Alexander Cloninger , Rahul Parhi , Yu-Xiang Wang

Multivariate Standardized Residuals for Conformal Prediction

While split conformal prediction guarantees marginal coverage, approaching the stronger property of conditional coverage is essential for reliable uncertainty quantification. Naive conformal scores, however, suffer from poor conditional…

机器学习 · 统计学 2026-05-08 Sacha Braun , Eugène Berta , Michael I. Jordan , Francis Bach

Sharp Gaussian approximations for Decentralized Federated Learning

Federated Learning has gained traction in privacy-sensitive collaborative environments, with local SGD emerging as a key optimization method in decentralized settings. While its convergence properties are well-studied, asymptotic…

机器学习 · 统计学 2026-05-08 Soham Bonnerjee , Sayar Karmakar , Wei Biao Wu

Revenue Maximization Under Sequential Price Competition Via The Estimation Of s-Concave Demand Functions

We consider price competition among multiple sellers over a selling horizon of $T$ periods. In each period, sellers simultaneously offer their prices (which are made public) and subsequently observe their respective demand (not made…

机器学习 · 统计学 2026-05-08 Daniele Bracale , Moulinath Banerjee , Cong Shi , Yuekai Sun

CatNet: Controlling the False Discovery Rate in LSTM with SHAP Feature Importance and Gaussian Mirrors

We introduce CatNet, an algorithm that effectively controls False Discovery Rate (FDR) and selects significant features in LSTM. CatNet employs the derivative of SHAP values to quantify the feature importance, and constructs a vector-formed…

机器学习 · 统计学 2026-05-08 Jiaan Han , Junxiao Chen , Yanzhe Fu

Scaling and renormalization in high-dimensional regression

From benign overfitting in overparameterized models to rich power-law scalings in performance, simple ridge regression displays surprising behaviors sometimes thought to be limited to deep neural networks. This balance of phenomenological…

机器学习 · 统计学 2026-05-08 Alexander Atanasov , Jacob A. Zavatone-Veth , Cengiz Pehlevan

Sharp Capacity Thresholds in Linear Associative Memory: From Winner-Take-All to Listwise Retrieval

How many key-value associations can a $d\times d$ linear memory store? We show that the answer depends not only on the $d^2$ degrees of freedom in the memory matrix, but also on the retrieval criterion. In an isotropic Gaussian model for…

机器学习 · 统计学 2026-05-07 Nicholas Barnfield , Juno Kim , Eshaan Nichani , Jason D. Lee , Yue M. Lu

Proximal Projection for Doubly Sparse Regularized Models

Regularization is often used in high-dimensional regression settings to generate a sparse model, which can save tremendous computing resources and identify predictors that are most strongly associated with the response. When the predictors…

机器学习 · 统计学 2026-05-07 Jia Wei He , R. Ayesha Ali , Gerarda Darlington