机器学习 — Scifaro

Phase transition of \emph{descending} phase retrieval algorithms

We study theoretical limits of \emph{descending} phase retrieval algorithms. Utilizing \emph{Random duality theory} (RDT) we develop a generic program that allows statistical characterization of various algorithmic performance metrics.…

机器学习 · 统计学 2025-06-24 Mihailo Stojnic

Identifiable Convex-Concave Regression via Sub-gradient Regularised Least Squares

We propose a novel nonparametric regression method that models complex input-output relationships as the sum of convex and concave components. The method-Identifiable Convex-Concave Nonparametric Least Squares (ICCNLS)-decomposes the target…

机器学习 · 统计学 2025-06-24 William Chung

Gaussian Processes and Reproducing Kernels: Connections and Equivalences

This monograph studies the relations between two approaches using positive definite kernels: probabilistic methods using Gaussian processes, and non-probabilistic methods using reproducing kernel Hilbert spaces (RKHS). They are widely…

机器学习 · 统计学 2025-06-24 Motonobu Kanagawa , Philipp Hennig , Dino Sejdinovic , Bharath K. Sriperumbudur

Differentiable neural network representation of multi-well, locally-convex potentials

Multi-well potentials are ubiquitous in science, modeling phenomena such as phase transitions, dynamic instabilities, and multimodal behavior across physics, chemistry, and biology. In contrast to non-smooth minimum-of-mixture…

机器学习 · 统计学 2025-06-24 Reese E. Jones , Adrian Buganza Tepole , Jan N. Fuhg

Fast Bayesian Optimization of Function Networks with Partial Evaluations

Bayesian optimization of function networks (BOFN) is a framework for optimizing expensive-to-evaluate objective functions structured as networks, where some nodes' outputs serve as inputs for others. Many real-world applications, such as…

机器学习 · 统计学 2025-06-24 Poompol Buathong , Peter I. Frazier

When to Forget? Complexity Trade-offs in Machine Unlearning

Machine Unlearning (MU) aims at removing the influence of specific data points from a trained model, striving to achieve this at a fraction of the cost of full model retraining. In this paper, we analyze the efficiency of unlearning methods…

机器学习 · 统计学 2025-06-24 Martin Van Waerebeke , Marco Lorenzi , Giovanni Neglia , Kevin Scaman

LoRA-One: One-Step Full Gradient Could Suffice for Fine-Tuning Large Language Models, Provably and Efficiently

This paper explores how theory can guide and enhance practical algorithms, using Low-Rank Adaptation (LoRA, Hu et al. 2022) in large language models as a case study. We rigorously prove that, under gradient descent, LoRA adapters align with…

机器学习 · 统计学 2025-06-24 Yuanhe Zhang , Fanghui Liu , Yudong Chen

Persistent Sampling: Enhancing the Efficiency of Sequential Monte Carlo

Sequential Monte Carlo (SMC) samplers are powerful tools for Bayesian inference but suffer from high computational costs due to their reliance on large particle ensembles for accurate estimates. We introduce persistent sampling (PS), an…

机器学习 · 统计学 2025-06-24 Minas Karamanis , Uroš Seljak

A generalized neural tangent kernel for surrogate gradient learning

State-of-the-art neural network training methods depend on the gradient of the network function. Therefore, they cannot be applied to networks whose activation functions do not have useful derivatives, such as binary and discrete-time…

机器学习 · 统计学 2025-06-24 Luke Eilers , Raoul-Martin Memmesheimer , Sven Goedeke

A Bayesian Non-parametric Approach to Generative Models: Integrating Variational Autoencoder and Generative Adversarial Networks using Wasserstein and Maximum Mean Discrepancy

We propose a novel generative model within the Bayesian non-parametric learning (BNPL) framework to address some notable failure modes in generative adversarial networks (GANs) and variational autoencoders (VAEs)--these being overfitting in…

机器学习 · 统计学 2025-06-24 Forough Fazeli-Asl , Michael Minyi Zhang

Controlling Moments with Kernel Stein Discrepancies

Kernel Stein discrepancies (KSDs) measure the quality of a distributional approximation and can be computed even when the target density has an intractable normalizing constant. Notable applications include the diagnosis of approximate MCMC…

机器学习 · 统计学 2025-06-24 Heishiro Kanagawa , Alessandro Barp , Arthur Gretton , Lester Mackey

Uncertainty-aware Efficient Subgraph Isomorphism using Graph Topology

Subgraph isomorphism, also known as subgraph matching, is typically regarded as an NP-complete problem. This complexity is further compounded in practical applications where edge weights are real-valued and may be affected by measurement…

机器学习 · 统计学 2025-06-24 Arpan Kusari , Wenbo Sun

Latent Noise Injection for Private and Statistically Aligned Synthetic Data Generation

Synthetic Data Generation has become essential for scalable, privacy-preserving statistical analysis. While standard approaches based on generative models, such as Normalizing Flows, have been widely used, they often suffer from slow…

机器学习 · 统计学 2025-06-23 Rex Shen , Lu Tian

On Continuous Monitoring of Risk Violations under Unknown Shift

Machine learning systems deployed in the real world must operate under dynamic and often unpredictable distribution shifts. This challenges the validity of statistical safety assurances on the system's risk established beforehand. Common…

机器学习 · 统计学 2025-06-23 Alexander Timans , Rajeev Verma , Eric Nalisnick , Christian A. Naesseth

Random feature approximation for general spectral methods

Random feature approximation is arguably one of the most widely used techniques for kernel methods in large-scale learning algorithms. In this work, we analyze the generalization properties of random feature methods, extending previous…

机器学习 · 统计学 2025-06-23 Mike Nguyen , Nicole Mücke

CP$^2$: Leveraging Geometry for Conformal Prediction via Canonicalization

We study the problem of conformal prediction (CP) under geometric data shifts, where data samples are susceptible to transformations such as rotations or flips. While CP endows prediction models with post-hoc uncertainty quantification and…

机器学习 · 统计学 2025-06-23 Putri A. van der Linden , Alexander Timans , Erik J. Bekkers

Diffusion-Based Hypothesis Testing and Change-Point Detection

Score-based methods have recently seen increasing popularity in modeling and generation. Methods have been constructed to perform hypothesis testing and change-point detection with score functions, but these methods are in general not as…

机器学习 · 统计学 2025-06-23 Sean Moushegian , Taposh Banerjee , Vahid Tarokh

From Local Interactions to Global Operators: Scalable Gaussian Process Operator for Physical Systems

Operator learning offers a powerful paradigm for solving parametric partial differential equations (PDEs), but scaling probabilistic neural operators such as the recently proposed Gaussian Processes Operators (GPOs) to high-dimensional,…

机器学习 · 统计学 2025-06-23 Sawan Kumar , Tapas Tripura , Rajdip Nayek , Souvik Chakraborty

Sampling conditioned diffusions via Pathspace Projected Monte Carlo

We present an algorithm to sample stochastic differential equations conditioned on rather general constraints, including integral constraints, endpoint constraints, and stochastic integral constraints. The algorithm is a pathspace…

机器学习 · 统计学 2025-06-23 Tobias Grafke

Uniform Mean Estimation for Heavy-Tailed Distributions via Median-of-Means

The Median of Means (MoM) is a mean estimator that has gained popularity in the context of heavy-tailed data. In this work, we analyze its performance in the task of simultaneously estimating the mean of each function in a class…

机器学习 · 统计学 2025-06-23 Mikael Møller Høgsgaard , Andrea Paudice