Related papers: Trade-off Between Dependence and Complexity for No…

Statistical Learning under Nonstationary Mixing Processes

We study a special case of the problem of statistical learning without the i.i.d. assumption. Specifically, we suppose a learning method is presented with a sequence of data points, and required to make a prediction (e.g., a classification)…

Machine Learning · Computer Science 2018-05-22 Steve Hanneke , Liu Yang

Theory and Algorithms for Forecasting Time Series

We present data-dependent learning bounds for the general scenario of non-stationary non-mixing stochastic processes. Our learning guarantees are expressed in terms of a data-dependent measure of sequential complexity and a discrepancy…

Machine Learning · Computer Science 2018-03-16 Vitaly Kuznetsov , Mehryar Mohri

Generalization bounds for learning under graph-dependence: A survey

Traditional statistical learning theory relies on the assumption that data are identically and independently distributed (i.i.d.). However, this assumption often does not hold in many real-life applications. In this survey, we explore…

Machine Learning · Computer Science 2024-04-09 Rui-Ray Zhang , Massih-Reza Amini

Information-Theoretic Bounds and Task-Centric Learning Complexity for Real-World Dynamic Nonlinear Systems

Dynamic nonlinear systems exhibit distortions arising from coupled static and dynamic effects. Their intertwined nature poses major challenges for data-driven modeling. This paper presents a theoretical framework grounded in structured…

Machine Learning · Computer Science 2025-09-23 Sri Satish Krishna Chaitanya Bulusu , Mikko Sillanpää

Memorizing without overfitting: Bias, variance, and interpolation in over-parameterized models

The bias-variance trade-off is a central concept in supervised learning. In classical statistics, increasing the complexity of a model (e.g., number of parameters) reduces bias but also increases variance. Until recently, it was commonly…

Machine Learning · Statistics 2022-03-25 Jason W. Rocks , Pankaj Mehta

Effective Sample Size and Generalization Bounds for Temporal Networks

Learning from time series is fundamentally different from learning from i.i.d.\ data: temporal dependence can make long sequences effectively information-poor, yet standard evaluation protocols conflate sequence length with statistical…

Machine Learning · Computer Science 2026-03-05 Barak Gahtan , Alex M. Bronstein

Learning Whenever Learning is Possible: Universal Learning under General Stochastic Processes

This work initiates a general study of learning and generalization without the i.i.d. assumption, starting from first principles. While the traditional approach to statistical learning theory typically relies on standard assumptions from…

Machine Learning · Statistics 2020-10-21 Steve Hanneke

Concentration Inequalities for Suprema of Empirical Processes with Dependent Data via Generic Chaining with Applications to Statistical Learning

This paper develops a general concentration inequality for the suprema of empirical processes with dependent data. The concentration inequality is obtained by combining generic chaining with a coupling-based strategy. Our framework…

Econometrics · Economics 2026-02-23 Chiara Amorino , Christian Brownlees , Ankita Ghosh

An Information-Theoretic Approach to Generalization Theory

We investigate the in-distribution generalization of machine learning algorithms. We depart from traditional complexity-based approaches by analyzing information-theoretic bounds that quantify the dependence between a learning algorithm and…

Machine Learning · Statistics 2024-08-27 Borja Rodríguez-Gálvez , Ragnar Thobaben , Mikael Skoglund

Towards Data-Algorithm Dependent Generalization: a Case Study on Overparameterized Linear Regression

One of the major open problems in machine learning is to characterize generalization in the overparameterized regime, where most traditional generalization bounds become inconsistent even for overparameterized linear regression. In many…

Machine Learning · Computer Science 2023-11-22 Jing Xu , Jiaye Teng , Yang Yuan , Andrew Chi-Chih Yao

Measures of Dependence based on Wasserstein distances

Measuring dependence between random variables is a fundamental problem in Statistics, with applications across diverse fields. While classical measures such as Pearson's correlation have been widely used for over a century, they have…

Statistics Theory · Mathematics 2025-10-08 Marta Catalano , Hugo Lavenant

Statistical learning for $\psi$-weakly dependent processes

We consider statistical learning question for $\psi$-weakly dependent processes, that unifies a large class of weak dependence conditions such as mixing, association,$\cdots$ The consistency of the empirical risk minimization algorithm is…

Statistics Theory · Mathematics 2022-10-04 Mamadou Lamine Diop , William Kengne

McDiarmid-Type Inequalities for Graph-Dependent Variables and Stability Bounds

A crucial assumption in most statistical learning theory is that samples are independently and identically distributed (i.i.d.). However, for many real applications, the i.i.d. assumption does not hold. We consider learning problems in…

Machine Learning · Computer Science 2019-09-10 Rui Ray Zhang , Xingwu Liu , Yuyi Wang , Liwei Wang

Learning from weakly dependent data under Dobrushin's condition

Statistical learning theory has largely focused on learning and generalization given independent and identically distributed (i.i.d.) samples. Motivated by applications involving time-series data, there has been a growing literature on…

Machine Learning · Computer Science 2019-06-24 Yuval Dagan , Constantinos Daskalakis , Nishanth Dikkala , Siddhartha Jayanti

Dependency-dependent Bounds for Sums of Dependent Random Variables

We consider the problem of bounding large deviations for non-i.i.d. random variables that are allowed to have arbitrary dependencies. Previous works typically assumed a specific dependence structure, namely the existence of independent…

Probability · Mathematics 2018-11-06 Christoph H. Lampert , Liva Ralaivola , Alexander Zimin

A Novel Data-Dependent Learning Paradigm for Large Hypothesis Classes

We address the general task of learning with a set of candidate models that is too large to have a uniform convergence of empirical estimates to true losses. While the common approach to such challenges is SRM (or regularization) based…

Machine Learning · Computer Science 2025-11-14 Alireza F. Pour , Shai Ben-David

Towards Optimal Problem Dependent Generalization Error Bounds in Statistical Learning Theory

We study problem-dependent rates, i.e., generalization errors that scale near-optimally with the variance, the effective loss, or the gradient norms evaluated at the "best hypothesis." We introduce a principled framework dubbed "uniform…

Machine Learning · Statistics 2020-12-25 Yunbei Xu , Assaf Zeevi

Margin-Based Transfer Bounds for Meta Learning with Deep Feature Embedding

By transferring knowledge learned from seen/previous tasks, meta learning aims to generalize well to unseen/future tasks. Existing meta-learning approaches have shown promising empirical performance on various multiclass classification…

Machine Learning · Computer Science 2020-12-04 Jiechao Guan , Zhiwu Lu , Tao Xiang , Timothy Hospedales

Multivariate Species Sampling Models

Species sampling processes have long served as the fundamental framework for modeling random discrete distributions and exchangeable sequences. However, data arising from distinct but related sources require a broader notion of…

Statistics Theory · Mathematics 2026-02-03 Beatrice Franzolini , Antonio Lijoi , Igor Prünster , Giovanni Rebaudo

Information-Theoretic Generalization Bounds for Meta-Learning and Applications

Meta-learning, or "learning to learn", refers to techniques that infer an inductive bias from data corresponding to multiple related tasks with the goal of improving the sample efficiency for new, previously unobserved, tasks. A key…

Machine Learning · Computer Science 2021-02-24 Sharu Theresa Jose , Osvaldo Simeone