机器学习 — Scifaro

CROCS: A Two-Stage Clustering Framework for Behaviour-Centric Consumer Segmentation with Smart Meter Data

With grid operators confronting rising uncertainty from renewable integration and a broader push toward electrification, Demand-Side Management (DSM) -- particularly Demand Response (DR) -- has attracted significant attention as a…

机器学习 · 统计学 2026-05-26 Luke W. Yerbury , Ricardo J. G. B. Campello , G. C. Livingston , Mark Goldsworthy , Lachlan O'Neil

Implicit geometric regularization in flow matching via density weighted Stein operators

Flow Matching (FM) has emerged as a powerful paradigm for continuous normalizing flows, yet standard FM implicitly performs an unweighted $L^2$ regression over the entire ambient space. In high dimensions, this leads to a fundamental…

机器学习 · 统计学 2026-05-26 Shinto Eguchi

Robust inference using density-powered Stein operators

We introduce a density-power weighted variant for the Stein operator, called the $\gamma$-Stein operator. This is a novel class of operators derived from the $\gamma$-divergence, designed to build robust inference methods for unnormalized…

机器学习 · 统计学 2026-05-26 Shinto Eguchi

A Spectral Framework for Graph Neural Operators: Convergence Guarantees and Tradeoffs

Graphons, as limits of graph sequences, provide an operator-theoretic framework for analyzing the asymptotic behavior of graph neural operators. Spectral convergence of sampled graphs to graphons induces convergence of the corresponding…

机器学习 · 统计学 2026-05-26 Roxanne Holden , Luana Ruiz

One-shot Conditional Sampling: MMD meets Nearest Neighbors

How can we generate samples from a conditional distribution that we never fully observe? This question arises across a broad range of applications in both modern machine learning and classical statistics, including image post-processing in…

机器学习 · 统计学 2026-05-26 Anirban Chatterjee , Sayantan Choudhury , Rohan Hore

Some Robustness Properties of Label Cleaning

We demonstrate that learning procedures that rely on aggregated labels, e.g., label information distilled from noisy responses, enjoy robustness properties impossible without data cleaning. This robustness appears in several ways. In the…

机器学习 · 统计学 2026-05-26 Chen Cheng , John Duchi

Neural Stochastic Differential Equations on Compact State Spaces: Theory, Methods, and Application to Suicide Risk Modeling

Ecological Momentary Assessment (EMA) studies enable the collection of high-frequency self-reports of suicidal thoughts and behaviors (STBs) via smartphones. Latent stochastic differential equations (SDEs) are a promising model class for…

机器学习 · 统计学 2026-05-26 Malinda Lu , Yue-Jane Liu , Matthew K. Nock , Yaniv Yacoby

Hybrid least squares for learning functions from highly noisy data

Motivated by the need for efficient estimation of conditional expectations, we consider a least-squares function approximation problem with heavily polluted data. Existing methods that are effective in the small-noise regime are suboptimal…

机器学习 · 统计学 2026-05-26 Ben Adcock , Bernhard Hientzsch , Akil Narayan , Yiming Xu

A Statistical Framework for Model Selection in LSTM Networks

Long Short-Term Memory (LSTM) neural network models have become the cornerstone for sequential data modeling in numerous applications, ranging from natural language processing to time series forecasting. Despite their success, the problem…

机器学习 · 统计学 2026-05-26 Fahad Mostafa

Re-examining Granger Causality with Causal Bayesian Networks and Reichenbachs Principles

Characterising cause-effect relationships in complex systems is fundamental to understanding their underlying mechanisms. Granger causality (GC) remains a widely used computational tool for identifying causal relationships in time series…

机器学习 · 统计学 2026-05-26 S. A. Adedayo

Applications of Trajectory Data from the Perspective of a Road Transportation Agency: Literature Review and Maryland Case Study

Transportation agencies have an opportunity to leverage increasingly-available trajectory datasets to improve their analyses and decision-making processes. However, this data is typically purchased from vendors, which means agencies must…

机器学习 · 统计学 2026-05-26 Nikola Marković , Przemysław Sekuła , Zachary Vander Laan , Gennady Andrienko , Natalia Andrienko

On the Stability of Spherical Hellinger-Kantorovich Flows and Their Implications for Differential Privacy

Gradient-flow sampling interprets a Gibbs distribution as the minimizer of an energy functional over probability measures and generates dynamics converging to this target. Under spherical Hellinger-Kantorovich (SHK) geometry, the flow…

机器学习 · 统计学 2026-05-25 Aratrika Mustafi , Soumya Mukherjee

Move on Muon : A Hamiltonian probability gradient flow perspective of Muon optimizer

We develop a gradient flow on the space of probability measures defined on matrix-valued parameters induced by regularized Muon, an analytically smoothed version of the idealized Muon optimizer. The key observation is that the regularized…

机器学习 · 统计学 2026-05-25 Aratrika Mustafi , Soumya Mukherjee , Bharath K. Sriperumbudur

Dirichlet-Based Monte Carlo Dropout for Uncertainty Estimation in Neural Networks

Traditional neural networks provide deterministic predictions without inherent uncertainty estimates. While Bayesian Neural Networks (BNNs) offer a principled approach to uncertainty quantification, their computational complexity limits…

机器学习 · 统计学 2026-05-25 Rouaa Hoblos , Noura Dridi , Noureddine Zerhouni , Zeina Al Masry

Asymmetric Scaling Laws from Sparse Features

We introduce a model for neural scaling laws under sparse activations. In the model, test loss is often dominated by rare coordinates that are never observed in the training input. This mechanism induces a novel bottleneck absent from dense…

机器学习 · 统计学 2026-05-25 John Sous , Michael Winer

Concomitant DAG Learning: On the Roles of Noise Adaptivity, Sparsity, and Non-negativity

Directed acyclic graphs (DAGs) constitute a central modeling tool to enable principled reasoning about cause-effect interactions in complex systems. However, since the causal structure underlying a group of variables is often unknown and…

机器学习 · 统计学 2026-05-25 Gonzalo Mateos , Samuel Rey , Hamed Ajorlou , Mariano Tepper

Coupled Training with Privileged Information and Unlabeled Data

In many prediction problems, we have extra information during training (for example, measurements that are expensive or slow to collect) that will not be available when the model is deployed. A common strategy is to first train a model that…

机器学习 · 统计学 2026-05-25 Jiahao Shi , Omar Hagrass , Jason M. Klusowski

Operationalizing Individual Fairness via Gradient Descent and Bradley-Terry Models

Individual fairness, the notion that "similar individuals should be treated similarly," provides a strong and flexible fairness guarantee for algorithmic decision makers. However, a barrier to implementing individual fairness in practice is…

机器学习 · 统计学 2026-05-25 Conlan Olson , Linjun Zhang , Zhun Deng , Pragya Sur

LLM Sparsity Prior for Robust Feature Selection

Large language models (LLMs) offer a scalable mechanism to elicit domain-informed prior information for high-dimensional variable selection. However, existing methods such as LLM-Lasso are sensitive to weight quality, with performance…

机器学习 · 统计学 2026-05-25 Caleb Skinner , Yihan Guo , Meng Li

Diffusion-based Denoising Beats Vanilla Score Matching in Parameter Estimation: A Theoretical Explanation

Score matching is an alternative to maximum likelihood estimation when the normalizing constant is unknown or too costly to evaluate. However, vanilla score matching has shown to be inefficient relative to maximum likelihood estimation for…

机器学习 · 统计学 2026-05-25 Benedikt Lütke Schwienhorst , Nadja Klein , Johannes Lederer