机器学习 — Scifaro

Overspecified Mixture Discriminant Analysis: Exponential Convergence, Statistical Guarantees, and Remote Sensing Applications

This study explores the classification error of Mixture Discriminant Analysis (MDA) in scenarios where the number of mixture components exceeds those present in the actual data distribution, a condition known as overspecification. We use a…

机器学习 · 统计学 2025-11-03 Arman Bolatov , Alan Legg , Igor Melnykov , Amantay Nurlanuly , Maxat Tezekbayev , Zhenisbek Assylbekov

Conformal Object Detection by Sequential Risk Control

Recent advances in object detectors have led to their adoption for industrial uses. However, their deployment in safety-critical applications is hindered by the inherent lack of reliability of neural networks and the complex structure of…

机器学习 · 统计学 2025-11-03 Léo andéol , Luca Mossina , Adrien Mazoyer , Sébastien Gerchinovitz

On the Tunability of Random Survival Forests Model for Predictive Maintenance

This paper investigates the tunability of the Random Survival Forest (RSF) model in predictive maintenance, where accurate time-to-failure estimation is crucial. Although RSF is widely used due to its flexibility and ability to handle…

机器学习 · 统计学 2025-11-03 Yigitcan Yardımcı , Mustafa Cavus

A Practical Introduction to Kernel Discrepancies: MMD, HSIC & KSD

This article provides a practical introduction to kernel discrepancies, focusing on the Maximum Mean Discrepancy (MMD), the Hilbert-Schmidt Independence Criterion (HSIC), and the Kernel Stein Discrepancy (KSD). Various estimators for these…

机器学习 · 统计学 2025-11-03 Antonin Schrab

DO-IQS: Dynamics-Aware Offline Inverse Q-Learning for Optimal Stopping with Unknown Gain Functions

We consider the Inverse Optimal Stopping (IOS) problem where, based on stopped expert trajectories, one aims to recover the optimal stopping region through the continuation and stopping gain functions approximation. The uniqueness of the…

机器学习 · 统计学 2025-11-03 Anna Kuchko

Generative Adversarial Networks for High-Dimensional Item Factor Analysis: A Deep Adversarial Learning Algorithm

Advances in deep learning and representation learning have transformed item factor analysis (IFA) in the item response theory (IRT) literature by enabling more efficient and accurate parameter estimation. Variational Autoencoders (VAEs)…

机器学习 · 统计学 2025-11-03 Nanyu Luo , Feng Ji

Supervised Quadratic Feature Analysis: Information Geometry Approach for Dimensionality Reduction

Supervised dimensionality reduction maps labeled data into a low-dimensional feature space while preserving class discriminability. A common approach is to maximize a statistical measure of dissimilarity between classes in the feature…

机器学习 · 统计学 2025-11-03 Daniel Herrera-Esposito , Johannes Burge

Deep learning joint extremes of metocean variables using the SPAR model

This paper presents a novel deep learning framework for estimating multivariate joint extremes of metocean variables, based on the Semi-Parametric Angular-Radial (SPAR) model. When considered in polar coordinates, the problem of modelling…

机器学习 · 统计学 2025-11-03 Ed Mackay , Callum Murphy-Barltrop , Jordan Richards , Philip Jonathan

A Unified Theory for Causal Inference: Direct Debiased Machine Learning via Bregman-Riesz Regression

This note introduces a unified theory for causal inference that integrates Riesz regression, covariate balancing, density-ratio estimation (DRE), targeted maximum likelihood estimation (TMLE), and the matching estimator in average treatment…

机器学习 · 统计学 2025-10-31 Masahiro Kato

Assessment of the conditional exchangeability assumption in causal machine learning models: a simulation study

Observational studies developing causal machine learning (ML) models for the prediction of individualized treatment effects (ITEs) seldom conduct empirical evaluations to assess the conditional exchangeability assumption. We aimed to…

机器学习 · 统计学 2025-10-31 Gerard T. Portela , Jason B. Gibbons , Sebastian Schneeweiss , Rishi J. Desai

Action-Driven Processes for Continuous-Time Control

At the heart of reinforcement learning are actions -- decisions made in response to observations of the environment. Actions are equally fundamental in the modeling of stochastic processes, as they trigger discontinuous state transitions…

机器学习 · 统计学 2025-10-31 Ruimin He , Shaowei Lin

Multi-Output Robust and Conjugate Gaussian Processes

Multi-output Gaussian process (MOGP) regression allows modelling dependencies among multiple correlated response variables. Similarly to standard Gaussian processes, MOGPs are sensitive to model misspecification and outliers, which can…

机器学习 · 统计学 2025-10-31 Joshua Rooijakkers , Leiv Rønneberg , François-Xavier Briol , Jeremias Knoblauch , Matias Altamirano

Uncertainty-Aware Diagnostics for Physics-Informed Machine Learning

Physics-informed machine learning (PIML) integrates prior physical information, often in the form of differential equation constraints, into the process of fitting machine learning models to physical data. Popular PIML approaches, including…

机器学习 · 统计学 2025-10-31 Mara Daniels , Liam Hodgkinson , Michael Mahoney

Data-driven Projection Generation for Efficiently Solving Heterogeneous Quadratic Programming Problems

We propose a data-driven framework for efficiently solving quadratic programming (QP) problems by reducing the number of variables in high-dimensional QPs using instance-specific projection. A graph neural network-based model is designed to…

机器学习 · 统计学 2025-10-31 Tomoharu Iwata , Futoshi Futami

$L_1$-norm Regularized Indefinite Kernel Logistic Regression

Kernel logistic regression (KLR) is a powerful classification method widely applied across diverse domains. In many real-world scenarios, indefinite kernels capture more domain-specific structural information than positive definite kernels.…

机器学习 · 统计学 2025-10-31 Shaoxin Wang , Hanjing Yao

Conformal Prediction Beyond the Horizon: Distribution-Free Inference for Policy Evaluation

Reliable uncertainty quantification is crucial for reinforcement learning (RL) in high-stakes settings. We propose a unified conformal prediction framework for infinite-horizon policy evaluation that constructs distribution-free prediction…

机器学习 · 统计学 2025-10-31 Feichen Gan , Youcun Lu , Yingying Zhang , Yukun Liu

Multimodal Bandits: Regret Lower Bounds and Optimal Algorithms

We consider a stochastic multi-armed bandit problem with i.i.d. rewards where the expected reward function is multimodal with at most m modes. We propose the first known computationally tractable algorithm for computing the solution to the…

机器学习 · 统计学 2025-10-31 William Réveillard , Richard Combes

Optimal Online Change Detection via Random Fourier Features

This article studies the problem of online non-parametric change point detection in multivariate data streams. We approach the problem through the lens of kernel-based two-sample testing and introduce a sequential testing procedure based on…

机器学习 · 统计学 2025-10-31 Florian Kalinke , Shakeel Gavioli-Akilagun

On the Impact of Performative Risk Minimization for Binary Random Variables

Performativity, the phenomenon where outcomes are influenced by predictions, is particularly prevalent in social contexts where individuals strategically respond to a deployed model. In order to preserve the high accuracy of machine…

机器学习 · 统计学 2025-10-31 Nikita Tsoy , Ivan Kirev , Negin Rahimiyazdi , Nikola Konstantinov

Beyond likelihood ratio bias: Nested multi-time-scale stochastic approximation for likelihood-free parameter estimation

We study parameter inference in simulation-based stochastic models where the analytical form of the likelihood is unknown. The main difficulty is that score evaluation as a ratio of noisy Monte Carlo estimators induces bias and instability,…

机器学习 · 统计学 2025-10-31 Zehao Li , Zhouchen Lin , Yijie Peng