机器学习 — Scifaro

Out-of-Sample Embedding with Proximity Data: Projection versus Restricted Reconstruction

The problem of using proximity (similarity or dissimilarity) data for the purpose of "adding a point to a vector diagram" was first studied by J.C. Gower in 1968. Since then, a number of methods -- mostly kernel methods -- have been…

机器学习 · 统计学 2025-05-13 Michael W. Trosset , Kaiyi Tan , Minh Tang , Carey E. Priebe

Learning Guarantee of Reward Modeling Using Deep Neural Networks

In this work, we study the learning theory of reward modeling with pairwise comparison data using deep neural networks. We establish a novel non-asymptotic regret bound for deep reward estimators in a non-parametric setting, which depends…

机器学习 · 统计学 2025-05-13 Yuanhang Luo , Yeheng Ge , Ruijian Han , Guohao Shen

Optimal Transport for Machine Learners

Optimal Transport is a foundational mathematical theory that connects optimization, partial differential equations, and probability. It offers a powerful framework for comparing probability distributions and has recently become an important…

机器学习 · 统计学 2025-05-13 Gabriel Peyré

High-Dimensional Importance-Weighted Information Criteria: Theory and Optimality

Imori and Ing (2025) proposed the importance-weighted orthogonal greedy algorithm (IWOGA) for model selection in high-dimensional misspecified regression models under covariate shift. To determine the number of IWOGA iterations, they…

机器学习 · 统计学 2025-05-13 Yong-Syun Cao , Shinpei Imori , Ching-Kang Ing

Fair Representation Learning for Continuous Sensitive Attributes using Expectation of Integral Probability Metrics

AI fairness, also known as algorithmic fairness, aims to ensure that algorithms operate without bias or discrimination towards any individual or group. Among various AI algorithms, the Fair Representation Learning (FRL) approach has gained…

机器学习 · 统计学 2025-05-13 Insung Kong , Kunwoong Kim , Yongdai Kim

Inverse Covariance and Partial Correlation Matrix Estimation via Joint Partial Regression

We present a method for estimating sparse high-dimensional inverse covariance and partial correlation matrices, which exploits the connection between the inverse covariance matrix and linear regression. The method is a two-stage estimation…

机器学习 · 统计学 2025-05-13 Samuel Erickson , Tobias Rydén

Investigating the Impact of Balancing, Filtering, and Complexity on Predictive Multiplicity: A Data-Centric Perspective

The Rashomon effect presents a significant challenge in model selection. It occurs when multiple models achieve similar performance on a dataset but produce different predictions, resulting in predictive multiplicity. This is especially…

机器学习 · 统计学 2025-05-13 Mustafa Cavus , Przemyslaw Biecek

Discrete distributions are learnable from metastable samples

Physically motivated stochastic dynamics are often used to sample from high-dimensional distributions. However such dynamics often get stuck in specific regions of their state space and mix very slowly to the desired stationary state. This…

机器学习 · 统计学 2025-05-13 Abhijith Jayakumar , Andrey Y. Lokhov , Sidhant Misra , Marc Vuffray

Transformers Handle Endogeneity in In-Context Linear Regression

We explore the capability of transformers to address endogeneity in in-context linear regression. Our main finding is that transformers inherently possess a mechanism to handle endogeneity effectively using instrumental variables (IV).…

机器学习 · 统计学 2025-05-13 Haodong Liang , Krishnakumar Balasubramanian , Lifeng Lai

Statistical Error Bounds for GANs with Nonlinear Objective Functionals

Generative adversarial networks (GANs) are unsupervised learning methods for training a generator distribution to produce samples that approximate those drawn from a target distribution. Many such methods can be formulated as minimization…

机器学习 · 统计学 2025-05-13 Jeremiah Birrell

Demystifying SGD with Doubly Stochastic Gradients

Optimization objectives in the form of a sum of intractable expectations are rising in importance (e.g., diffusion models, variational autoencoders, and many more), a setting also known as "finite sum with infinite data." For these…

机器学习 · 统计学 2025-05-13 Kyurae Kim , Joohwan Ko , Yi-An Ma , Jacob R. Gardner

Towards One Model for Classical Dimensionality Reduction: A Probabilistic Perspective on UMAP and t-SNE

This paper shows that dimensionality reduction methods such as UMAP and t-SNE, can be approximately recast as MAP inference methods corresponding to a model introduced in Ravuri et al. (2023), that describes the graph Laplacian (an estimate…

机器学习 · 统计学 2025-05-13 Aditya Ravuri , Neil D. Lawrence

On Kernel-based Variational Autoencoder

In this paper, we bridge Variational Autoencoders (VAEs) and kernel density estimations (KDEs) by approximating the posterior by KDEs and deriving an upper bound of the Kullback-Leibler (KL) divergence in the evidence lower bound (ELBO).…

机器学习 · 统计学 2025-05-13 Tian Qin , Wei-Min Huang

Privacy of SGD under Gaussian or Heavy-Tailed Noise: Guarantees without Gradient Clipping

The injection of heavy-tailed noise into the iterates of stochastic gradient descent (SGD) has garnered growing interest in recent years due to its theoretical and empirical benefits for optimization and generalization. However, its…

机器学习 · 统计学 2025-05-13 Umut Şimşekli , Mert Gürbüzbalaban , Sinan Yıldırım , Lingjiong Zhu

Nonlinear functional regression by functional deep neural network with kernel embedding

Recently, deep learning has been widely applied in functional data analysis (FDA) with notable empirical success. However, the infinite dimensionality of functional data necessitates an effective dimension reduction approach for functional…

机器学习 · 统计学 2025-05-13 Zhongjie Shi , Jun Fan , Linhao Song , Ding-Xuan Zhou , Johan A. K. Suykens

Extreme Scenario Selection in Day-Ahead Power Grid Operational Planning

We propose and analyze the application of statistical functional depth metrics for the selection of extreme scenarios in day-ahead grid planning. Our primary motivation is screening of probabilistic scenarios for realized load and renewable…

机器学习 · 统计学 2025-05-13 Guillermo Terrén-Serrano , Michael Ludkovski

Mode and Ridge Estimation in Euclidean and Directional Product Spaces: A Mean Shift Approach

The set of local modes and density ridge lines are important summary characteristics of the data-generating distribution. In this work, we focus on estimating local modes and density ridges from point cloud data in a product space combining…

机器学习 · 统计学 2025-05-13 Yikun Zhang , Yen-Chi Chen

Linear Convergence of the Subspace Constrained Mean Shift Algorithm: From Euclidean to Directional Data

This paper studies the linear convergence of the subspace constrained mean shift (SCMS) algorithm, a well-known algorithm for identifying a density ridge defined by a kernel density estimator. By arguing that the SCMS algorithm is a special…

机器学习 · 统计学 2025-05-13 Yikun Zhang , Yen-Chi Chen

Optimal Regret of Bernoulli Bandits under Global Differential Privacy

As sequential learning algorithms are increasingly applied to real life, ensuring data privacy while maintaining their utilities emerges as a timely question. In this context, regret minimisation in stochastic bandits under…

机器学习 · 统计学 2025-05-12 Achraf Azize , Yulian Wu , Junya Honda , Francesco Orabona , Shinji Ito , Debabrota Basu

On expected signatures and signature cumulants in semimartingale models

The concept of signatures and expected signatures is vital in data science, especially for sequential data analysis. The signature transform, a Cartan type development, translates paths into high-dimensional feature vectors, capturing their…

机器学习 · 统计学 2025-05-12 Peter K. Friz , Paul P. Hager , Nikolas Tapia