机器学习 — Scifaro

Compositional Generation for Long-Horizon Coupled PDEs

Simulating coupled PDE systems is computationally intensive, and prior efforts have largely focused on training surrogates on the joint (coupled) data, which requires a large amount of data. In the paper, we study compositional diffusion…

机器学习 · 统计学 2025-10-24 Somayajulu L. N. Dhulipala , Deep Ray , Nicholas Forman

Enhanced Cyclic Coordinate Descent Methods for Elastic Net Penalized Linear Models

We present a novel enhanced cyclic coordinate descent (ECCD) framework for solving generalized linear models with elastic net constraints that reduces training time in comparison to existing state-of-the-art methods. We redesign the CD…

机器学习 · 统计学 2025-10-24 Yixiao Wang , Zishan Shao , Ting Jiang , Aditya Devarakonda

Certified Self-Consistency: Statistical Guarantees and Test-Time Training for Reliable Reasoning in LLMs

Recent advances such as self-consistency and test-time reinforcement learning (TTRL) improve the reliability of large language models (LLMs) without additional supervision, yet their underlying mechanisms and statistical guarantees remain…

机器学习 · 统计学 2025-10-24 Paula Cordero-Encinar , Andrew B. Duncan

Optimal Dynamic Regret by Transformers for Non-Stationary Reinforcement Learning

Transformers have demonstrated exceptional performance across a wide range of domains. While their ability to perform reinforcement learning in-context has been established both theoretically and empirically, their behavior in…

机器学习 · 统计学 2025-10-24 Baiyuan Chen , Shinji Ito , Masaaki Imaizumi

A Neural Difference-of-Entropies Estimator for Mutual Information

Estimating Mutual Information (MI), a key measure of dependence of random quantities without specific modelling assumptions, is a challenging problem in high dimensions. We propose a novel mutual information estimator based on parametrizing…

机器学习 · 统计学 2025-10-24 Haoran Ni , Martin Lotz

Sample-efficient Learning of Concepts with Theoretical Guarantees: from Data to Concepts without Interventions

Machine learning is a vital part of many real-world systems, but several concerns remain about the lack of interpretability, explainability and robustness of black-box AI systems. Concept Bottleneck Models (CBM) address some of these…

机器学习 · 统计学 2025-10-24 Hidde Fokkema , Tim van Erven , Sara Magliacane

Statistical Inference for Generative Model Comparison

Generative models have achieved remarkable success across a range of applications, yet their evaluation still lacks principled uncertainty quantification. In this paper, we develop a method for comparing how close different generative…

机器学习 · 统计学 2025-10-24 Zijun Gao , Yan Sun , Han Su

Deep Continuous-Time State-Space Models for Marked Event Sequences

Marked temporal point processes (MTPPs) model sequences of events occurring at irregular time intervals, with wide-ranging applications in fields such as healthcare, finance and social networks. We propose the state-space point process…

机器学习 · 统计学 2025-10-24 Yuxin Chang , Alex Boyd , Cao Xiao , Taha Kass-Hout , Parminder Bhatia , Padhraic Smyth , Andrew Warrington

Stochastic gradient descent in high dimensions for multi-spiked tensor PCA

We study the high-dimensional dynamics of online stochastic gradient descent (SGD) for the multi-spiked tensor model. This multi-index model arises from the tensor principal component analysis (PCA) problem with multiple spikes, where the…

机器学习 · 统计学 2025-10-24 Gérard Ben Arous , Cédric Gerbelot , Vanessa Piccolo

On the Robustness of Kernel Goodness-of-Fit Tests

Goodness-of-fit testing is often criticized for its lack of practical relevance: since ``all models are wrong'', the null hypothesis that the data conform to our model is ultimately always rejected as the sample size grows. Despite this,…

机器学习 · 统计学 2025-10-24 Xing Liu , François-Xavier Briol

Causal Post-Processing of Predictive Models

Organizations increasingly rely on predictive models to decide who should be targeted for interventions, such as marketing campaigns, customer retention offers, or medical treatments. Yet these models are usually built to predict outcomes…

机器学习 · 统计学 2025-10-24 Carlos Fernández-Loría , Yanfang Hou , Foster Provost , Jennifer Hill

Learning Upper Lower Value Envelopes to Shape Online RL: A Principled Approach

We investigate the fundamental problem of leveraging offline data to accelerate online reinforcement learning - a direction with strong potential but limited theoretical grounding. Our study centers on how to learn and apply value envelopes…

机器学习 · 统计学 2025-10-23 Sebastian Reboul , Hélène Halconruy , Randal Douc

Square root Cox's survival analysis by the fittest linear and neural networks model

We revisit Cox's proportional hazard models and LASSO in the aim of improving feature selection in survival analysis. Unlike traditional methods relying on cross-validation or BIC, the penalty parameter $\lambda$ is directly tuned for…

机器学习 · 统计学 2025-10-23 Maxime van Cutsem , Sylvain Sardy

Metadata Extraction Leveraging Large Language Models

The advent of Large Language Models has revolutionized tasks across domains, including the automation of legal document analysis, a critical component of modern contract management systems. This paper presents a comprehensive implementation…

机器学习 · 统计学 2025-10-23 Cuize Han , Sesh Jalagam

Topology of Currencies: Persistent Homology for FX Co-movements: A Comparative Clustering Study

This study investigates whether Topological Data Analysis (TDA) can provide additional insights beyond traditional statistical methods in clustering currency behaviours. We focus on the foreign exchange (FX) market, which is a complex…

机器学习 · 统计学 2025-10-23 Pattravadee de Favereau de Jeneret , Ioannis Diamantis

Extreme Event Aware ($\eta$-) Learning

Quantifying and predicting rare and extreme events persists as a crucial yet challenging task in understanding complex dynamical systems. Many practical challenges arise from the infrequency and severity of these events, including the…

机器学习 · 统计学 2025-10-23 Kai Chang , Themistoklis P. Sapsis

From Reviews to Actionable Insights: An LLM-Based Approach for Attribute and Feature Extraction

This research proposes a systematic, large language model (LLM) approach for extracting product and service attributes, features, and associated sentiments from customer reviews. Grounded in marketing theory, the framework distinguishes…

机器学习 · 统计学 2025-10-23 Khaled Boughanmi , Kamel Jedidi , Nour Jedidi

The Coverage Principle: How Pre-Training Enables Post-Training

Language models demonstrate remarkable abilities when pre-trained on large text corpora and fine-tuned for specific tasks, but how and why pre-training shapes the success of the final model remains poorly understood. Notably, although…

机器学习 · 统计学 2025-10-23 Fan Chen , Audrey Huang , Noah Golowich , Sadhika Malladi , Adam Block , Jordan T. Ash , Akshay Krishnamurthy , Dylan J. Foster

Adjustment for Confounding using Pre-Trained Representations

There is growing interest in extending average treatment effect (ATE) estimation to incorporate non-tabular data, such as images and text, which may act as sources of confounding. Neglecting these effects risks biased results and flawed…

机器学习 · 统计学 2025-10-23 Rickmer Schulte , David Rügamer , Thomas Nagler

Non-Stationary Lipschitz Bandits

We study the problem of non-stationary Lipschitz bandits, where the number of actions is infinite and the reward function, satisfying a Lipschitz assumption, can change arbitrarily over time. We design an algorithm that adaptively tracks…

机器学习 · 统计学 2025-10-23 Nicolas Nguyen , Solenne Gaucher , Claire Vernade