机器学习 — Scifaro

No Certificate for Alignment: Two Independent Impossibilities and the Pareto Frontier of Achievable Safety Guarantees

We argue that formal certification of AI alignment over open-ended or unbounded input domains is impossible under standard assumptions in computational complexity and learning theory, and characterise what remains achievable. Two…

机器学习 · 统计学 2026-05-28 Ayushi Agarwal

Moment Matters: Mean and Variance Causal Graph Discovery from Heteroscedastic Observational Data

Heteroscedasticity -- where the variance of a variable changes with other variables -- is pervasive in real data, and elucidating why it arises from the perspective of statistical moments is crucial in scientific knowledge discovery and…

机器学习 · 统计学 2026-05-28 Yoichi Chikahara

The Well-Tempered Classifier: Some Elementary Properties of Temperature Scaling

Temperature scaling is a simple method that allows to control the uncertainty of probabilistic models. It is mostly used in two contexts: improving the calibration of classifiers and tuning the stochasticity of large language models (LLMs).…

机器学习 · 统计学 2026-05-28 Pierre-Alexandre Mattei , Bruno Loureiro

Corrected Samplers for Discrete Flow Models

Discrete flow models (DFMs) have been proposed to learn the data distribution on finite state space, offering a flexible framework as an alternative to discrete diffusion models. A line of recent work has studied samplers for discrete…

机器学习 · 统计学 2026-05-28 Zhengyan Wan , Yidong Ouyang , Liyan Xie , Hongyuan Zha , Fang Fang , Guang Cheng

DAISI: Data Assimilation with Inverse Sampling using Stochastic Interpolants

Data assimilation (DA) is a cornerstone of scientific and engineering applications, combining model forecasts with sparse and noisy observations to estimate latent system states. Classical high-dimensional DA methods, such as the ensemble…

机器学习 · 统计学 2026-05-28 Martin Andrae , Erik Wikingsson , So Takao , Tomas Landelius , Fredrik Lindsten

Linear Causal Representation Learning by Topological Ordering, Pruning, and Disentanglement

Causal representation learning (CRL) has garnered increasing interest from the causal inference and artificial intelligence communities due to its potential to disentangle complex data-generating mechanism into causally interpretable latent…

机器学习 · 统计学 2026-05-28 Hao Chen , Lin Liu , Yu Guang Wang

A Bayesian Nonparametric Perspective on Mahalanobis Distance for Out of Distribution Detection

Bayesian nonparametric methods are naturally suited to the problem of out-of-distribution (OOD) detection. However, these techniques have largely been eschewed in favor of simpler methods based on distances between pre-trained or learned…

机器学习 · 统计学 2026-05-28 Randolph W. Linderman , Noah Cowan , Yiran Chen , Scott W. Linderman

Isometry pursuit

Isometry pursuit is a convex algorithm for identifying orthonormal column-submatrices of wide matrices. It consists of a novel normalization method followed by multitask basis pursuit. Applied to Jacobians of putative coordinate functions,…

机器学习 · 统计学 2026-05-28 Samson Koelle , Marina Meila

Conformal Prediction for Hierarchical Data

We consider conformal prediction for multivariate data and focus on hierarchical data, where some components are linear combinations of others. Intuitively, the hierarchical structure can be leveraged to reduce the size of prediction…

机器学习 · 统计学 2026-05-28 Guillaume Principato , Gilles Stoltz , Yvenn Amara-Ouali , Yannig Goude , Bachir Hamrouche , Jean-Michel Poggi

Learning with Importance Weighted Variational Inference

Several variational bounds involving importance weighting ideas generalize the Evidence Lower BOund (ELBO) for marginal likelihood optimization, such as the Importance-weighted Auto-Encoder (IWAE), Variational R\'enyi (VR) and VR-IWAE…

机器学习 · 统计学 2026-05-28 Kamélia Daudel , François Roueff

Structure of Classifier Boundaries: Case Study for a Naive Bayes Classifier

For a Bayes classifier whose input space is a graph, we study the structure of the boundary, which comprises those points for which at least one neighbor is classified differently. The scientific setting is assignment of DNA reads produced…

机器学习 · 统计学 2026-05-28 Alan F. Karr , Zac Bowen , Adam A. Porter , Regina Ruane

Surrogate modeling for Bayesian optimization beyond a single Gaussian process

Bayesian optimization (BO) has well-documented merits for optimizing black-box functions with an expensive evaluation cost. Such functions emerge in applications as diverse as hyperparameter tuning, drug discovery, and robotics. BO hinges…

机器学习 · 统计学 2026-05-28 Qin Lu , Konstantinos D. Polyzos , Bingcong Li , Georgios B. Giannakis

Gaussian Process-based learning with new MCMC-based implementation of Wishart prior on correlation matrix

In probabilstic supervised learning of an input-output relationship - as a sample function of a Gaussian Process (GP) - priors are typically specified for the hyperparameters of the kernel that parametrises the covariance function of the…

机器学习 · 统计学 2026-05-27 Kane Warrior , Dalia Chakrabarty

Causal Representation Learning for Generalisable Recommendation

Predictive models trained on observational data often fail to generalise to the distributions they encounter when deployed, especially when the training data is a product of the system being optimised. Recommender systems are a canonical…

机器学习 · 统计学 2026-05-27 Yorgos Felekis , Michael O'Riordan , Oriol Corcoll , Ciarán M. Gilligan-Lee

Constrained Bayesian Experimental Design via Online Planning

Bayesian experimental design (BED) is a principled framework for data-efficient design of sequential experiments. However, existing BED methods are unable to adapt to dynamic constraints inherent in real-world tasks due to budget…

机器学习 · 统计学 2026-05-27 Yujia Guo , Daolang Huang , Xinyu Zhang , Sammie Katt , Samuel Kaski , Ayush Bharti

Signal-to-Noise Ratio and Sample Size Govern Representational Alignment in Neural Networks

Neural networks are known to develop latent representations that are $aligned$, namely structurally similar across networks trained with different architectures, training protocols, or training datasets. We study this phenomenon in a…

机器学习 · 统计学 2026-05-27 Ali Hussaini Umar , Alessandro Laio

Transformers Can Learn Posterior Predictive Distributions In-Context

Prior-data fitted networks (PFNs) have recently emerged as a powerful approach for Bayesian prediction tasks, approximating the posterior predictive distribution (PPD) through in-context learning. Despite their strong empirical performance…

机器学习 · 统计学 2026-05-27 Gyeonghun Kang , Changwoo J. Lee , Xiang Cheng

CART Random Forests as Sequential Allocation over Random Opportunity Sets: A Stochastic-Control Theory of Ensemble Risk

CART random forests are among the most widely used modern predictive methods, with well-documented empirical success. Yet, at the mechanistic level, the algorithm is often treated as a black box because of its complexity. In this paper, we…

机器学习 · 统计学 2026-05-27 Tianxing Mei , Yingying Fan , Mingming Leng , Jinchi Lv

When Does LeJEPA Learn a World Model?

A representation that scrambles the true degrees of freedom of the world cannot support reliable planning or compositional generalization. We prove that LeJEPA (alignment plus Gaussian regularization) linearly recovers the world's latent…

机器学习 · 统计学 2026-05-27 David Klindt , Yann LeCun , Randall Balestriero

Beyond Differences: Doubly Robust Meta-Learners for Ratio-Based Treatment Effects

When treatment effects are naturally expressed as ratios -- as in medicine, pricing, and marketing -- the ratio-based CATE $\tau(x) = E[Y|W=1,X=x] / E[Y|W=0,X=x]$ is the appropriate estimand. Yet existing estimators either impose a…

机器学习 · 统计学 2026-05-27 Michael Fuchs , Dominik Kreiss