机器学习 — Scifaro

Learning causal graphs using variable grouping according to ancestral relationship

Several causal discovery algorithms have been proposed. However, when the sample size is small relative to the number of variables, the accuracy of estimating causal graphs using existing methods decreases. And some methods are not feasible…

机器学习 · 统计学 2025-10-06 Ming Cai , Hisayuki Hara

Post Reinforcement Learning Inference

We study estimation and inference using data collected by reinforcement learning (RL) algorithms. These algorithms adaptively experiment by interacting with individual units over multiple stages, updating their strategies based on past…

机器学习 · 统计学 2025-10-06 Vasilis Syrgkanis , Ruohan Zhan

Diffusion Posterior Sampling for General Noisy Inverse Problems

Diffusion models have been recently studied as powerful generative inverse problem solvers, owing to their high quality reconstructions and the ease of combining existing iterative solvers. However, most works focus on solving simple linear…

机器学习 · 统计学 2025-10-06 Hyungjin Chung , Jeongsol Kim , Michael T. Mccann , Marc L. Klasky , Jong Chul Ye

Discrimination in machine learning algorithms

Machine learning algorithms are routinely used for business decisions that may directly affect individuals, for example, because a credit scoring algorithm refuses them a loan. It is then relevant from an ethical (and legal) point of view…

机器学习 · 统计学 2025-10-06 Roberta Pappadà , Francesco Pauli

Hybrid Physics-ML Framework for Pan-Arctic Permafrost Infrastructure Risk at Record 2.9-Million Observation Scale

Arctic warming threatens over 100 billion in permafrost-dependent infrastructure across Northern territories, yet existing risk assessment frameworks lack spatiotemporal validation, uncertainty quantification, and operational…

机器学习 · 统计学 2025-10-03 Boris Kriuk

Uniform-in-time convergence bounds for Persistent Contrastive Divergence Algorithms

We propose a continuous-time formulation of persistent contrastive divergence (PCD) for maximum likelihood estimation (MLE) of unnormalised densities. Our approach expresses PCD as a coupled, multiscale system of stochastic differential…

机器学习 · 统计学 2025-10-03 Paul Felix Valsecchi Oliva , O. Deniz Akyildiz , Andrew Duncan

A reproducible comparative study of categorical kernels for Gaussian process regression, with new clustering-based nested kernels

Designing categorical kernels is a major challenge for Gaussian process regression with continuous and categorical inputs. Despite previous studies, it is difficult to identify a preferred method, either because the evaluation metrics, the…

机器学习 · 统计学 2025-10-03 Raphaël Carpintero Perez , Sébastien Da Veiga , Josselin Garnier

AI Foundation Model for Time Series with Innovations Representation

This paper introduces an Artificial Intelligence (AI) foundation model for time series in engineering applications, where causal operations are required for real-time monitoring and control. Since engineering time series are governed by…

机器学习 · 统计学 2025-10-03 Lang Tong , Xinyi Wang

Risk Phase Transitions in Spiked Regression: Alignment Driven Benign and Catastrophic Overfitting

This paper analyzes the generalization error of minimum-norm interpolating solutions in linear regression using spiked covariance data models. The paper characterizes how varying spike strengths and target-spike alignments can affect risk,…

机器学习 · 统计学 2025-10-03 Jiping Li , Rishi Sonthalia

Continuously Augmented Discrete Diffusion model for Categorical Generative Modeling

Standard discrete diffusion models treat all unobserved states identically by mapping them to an absorbing [MASK] token. This creates an 'information void' where semantic information that could be inferred from unmasked tokens is lost…

机器学习 · 统计学 2025-10-03 Huangjie Zheng , Shansan Gong , Ruixiang Zhang , Tianrong Chen , Jiatao Gu , Mingyuan Zhou , Navdeep Jaitly , Yizhe Zhang

Private Realizable-to-Agnostic Transformation with Near-Optimal Sample Complexity

The realizable-to-agnostic transformation (Beimel et al., 2015; Alon et al., 2020) provides a general mechanism to convert a private learner in the realizable setting (where the examples are labeled by some function in the concept class) to…

机器学习 · 统计学 2025-10-03 Bo Li , Wei Wang , Peng Ye

WWAggr: A Window Wasserstein-based Aggregation for Ensemble Change Point Detection

Change Point Detection (CPD) aims to identify moments of abrupt distribution shifts in data streams. Real-world high-dimensional CPD remains challenging due to data pattern complexity and violation of common assumptions. Resorting to…

机器学习 · 统计学 2025-10-03 Alexander Stepikin , Evgenia Romanenkova , Alexey Zaytsev

SIM-Shapley: A Stable and Computationally Efficient Approach to Shapley Value Approximation

Explainable artificial intelligence (XAI) is essential for trustworthy machine learning (ML), particularly in high-stakes domains such as healthcare and finance. Shapley value (SV) methods provide a principled framework for feature…

机器学习 · 统计学 2025-10-03 Wangxuan Fan , Siqi Li , Doudou Zhou , Yohei Okada , Chuan Hong , Molei Liu , Nan Liu

Policy-Oriented Binary Classification: Improving (KD-)CART Final Splits for Subpopulation Targeting

Policymakers often use recursive binary split rules to partition populations based on binary outcomes and target subpopulations whose probability of the binary event exceeds a threshold. We call such problems Latent Probability…

机器学习 · 统计学 2025-10-03 Lei Bill Wang , Zhenbang Jiao , Fangyi Wang

Using matrix-product states for time-series machine learning

Matrix-product states (MPS) have proven to be a versatile ansatz for modeling quantum many-body physics. For many applications, and particularly in one-dimension, they capture relevant quantum correlations in many-body wavefunctions while…

机器学习 · 统计学 2025-10-03 Joshua B. Moore , Hugo P. Stackhouse , Ben D. Fulcher , Sahand Mahmoodian

Neural Network Parameter-optimization of Gaussian pmDAGs

Finding the parameters of a latent variable causal model is central to causal inference and causal identification. In this article, we show that existing graphical structures that are used in causal inference are not stable under…

机器学习 · 统计学 2025-10-03 Mehrzad Saremi

Convergence analysis of online algorithms for vector-valued kernel regression

We consider the problem of approximating the regression function $f_\mu:\, \Omega \to Y$ from noisy $\mu$-distributed vector-valued data $(\omega_m,y_m)\in\Omega\times Y$ by an online learning algorithm using a reproducing kernel Hilbert…

机器学习 · 统计学 2025-10-03 Michael Griebel , Peter Oswald

Theory of Scaling Laws for In-Context Regression: Depth, Width, Context and Time

We study in-context learning (ICL) of linear regression in a deep linear self-attention model, characterizing how performance depends on various computational and statistical resources (width, depth, number of training steps, batch size and…

机器学习 · 统计学 2025-10-02 Blake Bordelon , Mary I. Letey , Cengiz Pehlevan

Optimal placement of wind farms via quantile constraint learning

Wind farm placement arranges the size and the location of multiple wind farms within a given region. The power output is highly related to the wind speed on spatial and temporal levels, which can be modeled by advanced data-driven…

机器学习 · 统计学 2025-10-02 Wenxiu Feng , Antonio Alcántara , Carlos Ruiz

Approximation of differential entropy in Bayesian optimal experimental design

Bayesian optimal experimental design provides a principled framework for selecting experimental settings that maximize obtained information. In this work, we focus on estimating the expected information gain in the setting where the…

机器学习 · 统计学 2025-10-02 Chuntao Chen , Tapio Helin , Nuutti Hyvönen , Yuya Suzuki