English
Related papers

Related papers: Large scale analysis of generalization error in le…

200 papers

Margin-based classifiers have been popular in both machine learning and statistics for classification problems. Since a large number of classifiers are available, one natural question is which type of classifiers should be used given a…

Machine Learning · Statistics 2021-10-19 Hanwen Huang , Qinglong Yang

We consider the classification problem of a high-dimensional mixture of two Gaussians with general covariance matrices. Using the replica method from statistical physics, we investigate the asymptotic behavior of a general class of…

Machine Learning · Statistics 2024-10-29 Hanwen Huang , Peng Zeng

Modern machine learning classifiers often exhibit vanishing classification error on the training set. They achieve this by learning nonlinear representations of the inputs that maps the data into linearly separable classes. Motivated by…

Statistics Theory · Mathematics 2023-03-23 Andrea Montanari , Feng Ruan , Youngtak Sohn , Jun Yan

In this article, we study rates of convergence of the generalization error of multi-class margin classifiers. In particular, we develop an upper bound theory quantifying the generalization error of various large margin classifiers. The…

Statistics Theory · Mathematics 2011-11-10 Xiaotong Shen , Lifeng Wang

Maximum margin binary classification is one of the most fundamental algorithms in machine learning, yet the role of featurization maps and the high-dimensional asymptotics of the misclassification error for non-Gaussian features are still…

Statistics Theory · Mathematics 2023-10-03 Andrea Montanari , Feng Ruan , Basil Saeed , Youngtak Sohn

We study generalised linear regression and classification for a synthetically generated dataset encompassing different problems of interest, such as learning with random features, neural networks in the lazy training regime, and the hidden…

Statistics Theory · Mathematics 2022-03-28 Federica Gerace , Bruno Loureiro , Florent Krzakala , Marc Mézard , Lenka Zdeborová

We obtain a tight distribution-specific characterization of the sample complexity of large-margin classification with L2 regularization: We introduce the margin-adapted dimension, which is a simple function of the second order statistics of…

Machine Learning · Statistics 2013-09-19 Sivan Sabato , Nathan Srebro , Naftali Tishby

We present a formulation of deep learning that aims at producing a large margin classifier. The notion of margin, minimum distance to a decision boundary, has served as the foundation of several theoretically profound and empirically…

Machine Learning · Statistics 2018-12-05 Gamaleldin F. Elsayed , Dilip Krishnan , Hossein Mobahi , Kevin Regan , Samy Bengio

At the heart of machine learning lies the question of generalizability of learned rules over previously unseen data. While over-parameterized models based on neural networks are now ubiquitous in machine learning applications, our…

Machine Learning · Computer Science 2020-05-04 Melikasadat Emami , Mojtaba Sahraee-Ardakan , Parthe Pandit , Sundeep Rangan , Alyson K. Fletcher

We consider a model for logistic regression where only a subset of features of size $p$ is used for training a linear classifier over $n$ training samples. The classifier is obtained by running gradient descent (GD) on logistic loss. For…

Machine Learning · Statistics 2020-05-12 Zeyu Deng , Abla Kammoun , Christos Thrampoulidis

This article carries out a large dimensional analysis of standard regularized discriminant analysis classifiers designed on the assumption that data arise from a Gaussian mixture model with different means and covariances. The analysis…

Machine Learning · Statistics 2019-06-19 Khalil Elkhalil , Abla Kammoun , Romain Couillet , Tareq Y. Al-Naffouri , Mohamed-Slim Alouini

For a large class of feature maps we provide a tight asymptotic characterisation of the test error associated with learning the readout layer, in the high-dimensional limit where the input dimension, hidden layer widths, and number of…

Machine Learning · Statistics 2024-06-11 Dominik Schröder , Daniil Dmitriev , Hugo Cui , Bruno Loureiro

For linear classifiers, the relationship between (normalized) output margin and generalization is captured in a clear and simple bound -- a large output margin implies good generalization. Unfortunately, for deep models, this relationship…

Machine Learning · Computer Science 2021-06-17 Colin Wei , Tengyu Ma

We obtain a tight distribution-specific characterization of the sample complexity of large-margin classification with L_2 regularization: We introduce the \gamma-adapted-dimension, which is a simple function of the spectrum of a…

Machine Learning · Computer Science 2015-03-17 Sivan Sabato , Nathan Srebro , Naftali Tishby

We study in this paper lower bounds for the generalization error of models derived from multi-layer neural networks, in the regime where the size of the layers is commensurate with the number of samples in the training data. We show that…

Machine Learning · Statistics 2022-07-08 Inbar Seroussi , Ofer Zeitouni

As shown in recent research, deep neural networks can perfectly fit randomly labeled data, but with very poor accuracy on held out data. This phenomenon indicates that loss functions such as cross-entropy are not a reliable indicator of…

Machine Learning · Statistics 2019-06-13 Yiding Jiang , Dilip Krishnan , Hossein Mobahi , Samy Bengio

The generalization error of deep neural networks via their classification margin is studied in this work. Our approach is based on the Jacobian matrix of a deep neural network and can be applied to networks with arbitrary non-linearities…

Machine Learning · Statistics 2017-07-04 Jure Sokolic , Raja Giryes , Guillermo Sapiro , Miguel R. D. Rodrigues

We study the distributional properties of the linear discriminant function under the assumption of normality by comparing two groups with the same covariance matrix but different mean vectors. A stochastic representation for the…

Statistics Theory · Mathematics 2017-05-09 Taras Bodnar , Stepan Mazur , Edward Ngailo , Nestor Parolya

Generalised linear models for multi-class classification problems are one of the fundamental building blocks of modern machine learning tasks. In this manuscript, we characterise the learning of a mixture of $K$ Gaussians with generic means…

We study general singular value shrinkage estimators in high-dimensional regression and classification, when the number of features and the sample size both grow proportionally to infinity. We allow models with general covariance matrices…

Statistics Theory · Mathematics 2020-04-01 Panagiotis Lolas
‹ Prev 1 2 3 10 Next ›