Related papers: Large scale analysis of generalization error in le…

Large dimensional analysis of general margin based classification methods

Margin-based classifiers have been popular in both machine learning and statistics for classification problems. Since a large number of classifiers are available, one natural question is which type of classifiers should be used given a…

Machine Learning · Statistics 2021-10-19 Hanwen Huang , Qinglong Yang

Statistical Inference in Classification of High-dimensional Gaussian Mixture

We consider the classification problem of a high-dimensional mixture of two Gaussians with general covariance matrices. Using the replica method from statistical physics, we investigate the asymptotic behavior of a general class of…

Machine Learning · Statistics 2024-10-29 Hanwen Huang , Peng Zeng

The generalization error of max-margin linear classifiers: Benign overfitting and high dimensional asymptotics in the overparametrized regime

Modern machine learning classifiers often exhibit vanishing classification error on the training set. They achieve this by learning nonlinear representations of the inputs that maps the data into linearly separable classes. Motivated by…

Statistics Theory · Mathematics 2023-03-23 Andrea Montanari , Feng Ruan , Youngtak Sohn , Jun Yan

Generalization error for multi-class margin classification

In this article, we study rates of convergence of the generalization error of multi-class margin classifiers. In particular, we develop an upper bound theory quantifying the generalization error of various large margin classifiers. The…

Statistics Theory · Mathematics 2011-11-10 Xiaotong Shen , Lifeng Wang

Universality of max-margin classifiers

Maximum margin binary classification is one of the most fundamental algorithms in machine learning, yet the role of featurization maps and the high-dimensional asymptotics of the misclassification error for non-Gaussian features are still…

Statistics Theory · Mathematics 2023-10-03 Andrea Montanari , Feng Ruan , Basil Saeed , Youngtak Sohn

Generalisation error in learning with random features and the hidden manifold model

We study generalised linear regression and classification for a synthetically generated dataset encompassing different problems of interest, such as learning with random features, neural networks in the lazy training regime, and the hidden…

Statistics Theory · Mathematics 2022-03-28 Federica Gerace , Bruno Loureiro , Florent Krzakala , Marc Mézard , Lenka Zdeborová

Distribution-Dependent Sample Complexity of Large Margin Learning

We obtain a tight distribution-specific characterization of the sample complexity of large-margin classification with L2 regularization: We introduce the margin-adapted dimension, which is a simple function of the second order statistics of…

Machine Learning · Statistics 2013-09-19 Sivan Sabato , Nathan Srebro , Naftali Tishby

Large Margin Deep Networks for Classification

We present a formulation of deep learning that aims at producing a large margin classifier. The notion of margin, minimum distance to a decision boundary, has served as the foundation of several theoretically profound and empirically…

Machine Learning · Statistics 2018-12-05 Gamaleldin F. Elsayed , Dilip Krishnan , Hossein Mobahi , Kevin Regan , Samy Bengio

Generalization Error of Generalized Linear Models in High Dimensions

At the heart of machine learning lies the question of generalizability of learned rules over previously unseen data. While over-parameterized models based on neural networks are now ubiquitous in machine learning applications, our…

Machine Learning · Computer Science 2020-05-04 Melikasadat Emami , Mojtaba Sahraee-Ardakan , Parthe Pandit , Sundeep Rangan , Alyson K. Fletcher

A Model of Double Descent for High-dimensional Binary Linear Classification

We consider a model for logistic regression where only a subset of features of size $p$ is used for training a linear classifier over $n$ training samples. The classifier is obtained by running gradient descent (GD) on logistic loss. For…

Machine Learning · Statistics 2020-05-12 Zeyu Deng , Abla Kammoun , Christos Thrampoulidis

A Large Dimensional Study of Regularized Discriminant Analysis Classifiers

This article carries out a large dimensional analysis of standard regularized discriminant analysis classifiers designed on the assumption that data arise from a Gaussian mixture model with different means and covariances. The analysis…

Machine Learning · Statistics 2019-06-19 Khalil Elkhalil , Abla Kammoun , Romain Couillet , Tareq Y. Al-Naffouri , Mohamed-Slim Alouini

Asymptotics of Learning with Deep Structured (Random) Features

For a large class of feature maps we provide a tight asymptotic characterisation of the test error associated with learning the readout layer, in the high-dimensional limit where the input dimension, hidden layer widths, and number of…

Machine Learning · Statistics 2024-06-11 Dominik Schröder , Daniil Dmitriev , Hugo Cui , Bruno Loureiro

Improved Sample Complexities for Deep Networks and Robust Classification via an All-Layer Margin

For linear classifiers, the relationship between (normalized) output margin and generalization is captured in a clear and simple bound -- a large output margin implies good generalization. Unfortunately, for deep models, this relationship…

Machine Learning · Computer Science 2021-06-17 Colin Wei , Tengyu Ma

Tight Sample Complexity of Large-Margin Learning

We obtain a tight distribution-specific characterization of the sample complexity of large-margin classification with L_2 regularization: We introduce the \gamma-adapted-dimension, which is a simple function of the spectrum of a…

Machine Learning · Computer Science 2015-03-17 Sivan Sabato , Nathan Srebro , Naftali Tishby

Lower Bounds on the Generalization Error of Nonlinear Learning Models

We study in this paper lower bounds for the generalization error of models derived from multi-layer neural networks, in the regime where the size of the layers is commensurate with the number of samples in the training data. We show that…

Machine Learning · Statistics 2022-07-08 Inbar Seroussi , Ofer Zeitouni

Predicting the Generalization Gap in Deep Networks with Margin Distributions

As shown in recent research, deep neural networks can perfectly fit randomly labeled data, but with very poor accuracy on held out data. This phenomenon indicates that loss functions such as cross-entropy are not a reliable indicator of…

Machine Learning · Statistics 2019-06-13 Yiding Jiang , Dilip Krishnan , Hossein Mobahi , Samy Bengio

Robust Large Margin Deep Neural Networks

The generalization error of deep neural networks via their classification margin is studied in this work. Our approach is based on the Jacobian matrix of a deep neural network and can be applied to networks with arbitrary non-linearities…

Machine Learning · Statistics 2017-07-04 Jure Sokolic , Raja Giryes , Guillermo Sapiro , Miguel R. D. Rodrigues

Discriminant analysis in small and large dimensions

We study the distributional properties of the linear discriminant function under the assumption of normality by comparing two groups with the same covariance matrix but different mean vectors. A stochastic representation for the…

Statistics Theory · Mathematics 2017-05-09 Taras Bodnar , Stepan Mazur , Edward Ngailo , Nestor Parolya

Learning Gaussian Mixtures with Generalised Linear Models: Precise Asymptotics in High-dimensions

Generalised linear models for multi-class classification problems are one of the fundamental building blocks of modern machine learning tasks. In this manuscript, we characterise the learning of a mixture of $K$ Gaussians with generic means…

Machine Learning · Statistics 2023-06-06 Bruno Loureiro , Gabriele Sicuro , Cédric Gerbelot , Alessandro Pacco , Florent Krzakala , Lenka Zdeborová

Regularization in High-Dimensional Regression and Classification via Random Matrix Theory

We study general singular value shrinkage estimators in high-dimensional regression and classification, when the number of features and the sample size both grow proportionally to infinity. We allow models with general covariance matrices…

Statistics Theory · Mathematics 2020-04-01 Panagiotis Lolas