机器学习 — Scifaro

Generalization error property of infoGAN for two-layer neural network

Information Maximizing Generative Adversarial Network (infoGAN) can be understood as a minimax problem involving two neural networks: discriminators and generators with mutual information functions. The infoGAN incorporates various…

机器学习 · 统计学 2025-05-23 Mahmud Hasan , Mathias Muia

Estimate-Then-Optimize versus Integrated-Estimation-Optimization versus Sample Average Approximation: A Stochastic Dominance Perspective

In data-driven stochastic optimization, model parameters of the underlying distribution need to be estimated from data in addition to the optimization task. Recent literature considers integrating the estimation and optimization processes…

机器学习 · 统计学 2025-05-23 Adam N. Elmachtoub , Henry Lam , Haofeng Zhang , Yunfan Zhao

Are machine learning interpretations reliable? A stability study on global interpretations

As machine learning systems are increasingly used in high-stakes domains, there is a growing emphasis placed on making them interpretable to improve trust in these systems. In response, a range of interpretable machine learning (IML)…

机器学习 · 统计学 2025-05-22 Luqin Gan , Tarek M. Zikry , Genevera I. Allen

Uncertainty Quantification in SVM prediction

This paper explores Uncertainty Quantification (UQ) in SVM predictions, particularly for regression and forecasting tasks. Unlike the Neural Network, the SVM solutions are typically more stable, sparse, optimal and interpretable. However,…

机器学习 · 统计学 2025-05-22 Pritam Anand

Robust Multimodal Learning via Entropy-Gated Contrastive Fusion

Real-world multimodal systems routinely face missing-input scenarios, and in reality, robots lose audio in a factory or a clinical record omits lab tests at inference time. Standard fusion layers either preserve robustness or calibration…

机器学习 · 统计学 2025-05-22 Leon Chlon , Maggie Chlon , MarcAntonio M. Awada

Infinite hierarchical contrastive clustering for personal digital envirotyping

Daily environments have profound influence on our health and behavior. Recent work has shown that digital envirotyping, where computer vision is applied to images of daily environments taken during ecological momentary assessment (EMA), can…

机器学习 · 统计学 2025-05-22 Ya-Yun Huang , Joseph McClernon , Jason A. Oliver , Matthew M. Engelhard

Convergence of Adam in Deep ReLU Networks via Directional Complexity and Kakeya Bounds

First-order adaptive optimization methods like Adam are the default choices for training modern deep neural networks. Despite their empirical success, the theoretical understanding of these methods in non-smooth settings, particularly in…

机器学习 · 统计学 2025-05-22 Anupama Sridhar , Alexander Johansen

LOBSTUR: A Local Bootstrap Framework for Tuning Unsupervised Representations in Graph Neural Networks

Graph Neural Networks (GNNs) are increasingly used in conjunction with unsupervised learning techniques to learn powerful node representations, but their deployment is hindered by their high sensitivity to hyperparameter tuning and the…

机器学习 · 统计学 2025-05-22 So Won Jeong , Claire Donnat

Hierarchical clustering with maximum density paths and mixture models

Hierarchical clustering is an effective, interpretable method for analyzing structure in data. It reveals insights at multiple scales without requiring a predefined number of clusters and captures nested patterns and subtle relationships,…

机器学习 · 统计学 2025-05-22 Martin Ritzert , Polina Turishcheva , Laura Hansel , Paul Wollenhaupt , Marissa A. Weis , Alexander S. Ecker

Improving the statistical efficiency of cross-conformal prediction

Vovk (2015) introduced cross-conformal prediction, a modification of split conformal designed to improve the width of prediction sets. The method, when trained with a miscoverage rate equal to $\alpha$ and $n \gg K$, ensures a marginal…

机器学习 · 统计学 2025-05-22 Matteo Gasparin , Aaditya Ramdas

Convergence of TD(0) under Polynomial Mixing with Nonlinear Function Approximation

Temporal Difference Learning (TD(0)) is fundamental in reinforcement learning, yet its finite-sample behavior under non-i.i.d. data and nonlinear approximation remains unknown. We provide the first high-probability, finite-sample analysis…

机器学习 · 统计学 2025-05-22 Anupama Sridhar , Alexander Johansen

Graph Neural Networks Do Not Always Oversmooth

Graph neural networks (GNNs) have emerged as powerful tools for processing relational data in applications. However, GNNs suffer from the problem of oversmoothing, the property that the features of all nodes exponentially converge to the…

机器学习 · 统计学 2025-05-22 Bastian Epping , Alexandre René , Moritz Helias , Michael T. Schaub

Learning Memory Kernels in Generalized Langevin Equations

We introduce a novel approach for learning memory kernels in Generalized Langevin Equations. This approach initially utilizes a regularized Prony method to estimate correlation functions from trajectory data, followed by regression over a…

机器学习 · 统计学 2025-05-22 Quanjun Lang , Jianfeng Lu

A simple estimator of the correlation kernel matrix of a determinantal point process

The Determinantal Point Process (DPP) is a parameterized model for multivariate binary variables, characterized by a correlation kernel matrix. This paper proposes a closed form estimator of this kernel, which is particularly easy to…

机器学习 · 统计学 2025-05-21 Christian Gouriéroux , Yang Lu

A system identification approach to clustering vector autoregressive time series

Clustering of time series based on their underlying dynamics is keeping attracting researchers due to its impacts on assisting complex system modelling. Most current time series clustering methods handle only scalar time series, treat them…

机器学习 · 统计学 2025-05-21 Zuogong Yue , Xinyi Wang , Victor Solo

High-dimensional Nonparametric Contextual Bandit Problem

We consider the kernelized contextual bandit problem with a large feature space. This problem involves $K$ arms, and the goal of the forecaster is to maximize the cumulative rewards through learning the relationship between the contexts and…

机器学习 · 统计学 2025-05-21 Shogo Iwazaki , Junpei Komiyama , Masaaki Imaizumi

Computational Efficiency under Covariate Shift in Kernel Ridge Regression

This paper addresses the covariate shift problem in the context of nonparametric regression within reproducing kernel Hilbert spaces (RKHSs). Covariate shift arises in supervised learning when the input distributions of the training and…

机器学习 · 统计学 2025-05-21 Andrea Della Vecchia , Arnaud Mavakala Watusadisi , Ernesto De Vito , Lorenzo Rosasco

Randomised Optimism via Competitive Co-Evolution for Matrix Games with Bandit Feedback

Learning in games is a fundamental problem in machine learning and artificial intelligence, with numerous applications~\citep{silver2016mastering,schrittwieser2020mastering}. This work investigates two-player zero-sum matrix games with an…

机器学习 · 统计学 2025-05-21 Shishen Lin

Thompson Sampling-like Algorithms for Stochastic Rising Bandits

Stochastic rising rested bandit (SRRB) is a setting where the arms' expected rewards increase as they are pulled. It models scenarios in which the performances of the different options grow as an effect of an underlying learning process…

机器学习 · 统计学 2025-05-21 Marco Fiandri , Alberto Maria Metelli , Francesco Trovò

Coreset selection for the Sinkhorn divergence and generic smooth divergences

We introduce CO2, an efficient algorithm to produce convexly-weighted coresets with respect to generic smooth divergences. By employing a functional Taylor expansion, we show a local equivalence between sufficiently regular losses and their…

机器学习 · 统计学 2025-05-21 Alex Kokot , Alex Luedtke