English
Related papers

Related papers: Large dimensional analysis of general margin based…

200 papers

Large-margin classifiers are popular methods for classification. We derive the asymptotic expression for the generalization error of a family of large-margin classifiers in the limit of both sample size $n$ and dimension $p$ going to…

Machine Learning · Statistics 2020-12-02 Hanwen Huang , Qinglong Yang

Modern machine learning classifiers often exhibit vanishing classification error on the training set. They achieve this by learning nonlinear representations of the inputs that maps the data into linearly separable classes. Motivated by…

Statistics Theory · Mathematics 2023-03-23 Andrea Montanari , Feng Ruan , Youngtak Sohn , Jun Yan

Logistic models are commonly used for binary classification tasks. The success of such models has often been attributed to their connection to maximum-likelihood estimators. It has been shown that gradient descent algorithm, when applied on…

Machine Learning · Statistics 2020-10-30 Fariborz Salehi , Ehsan Abbasi , Babak Hassibi

Maximum margin binary classification is one of the most fundamental algorithms in machine learning, yet the role of featurization maps and the high-dimensional asymptotics of the misclassification error for non-Gaussian features are still…

Statistics Theory · Mathematics 2023-10-03 Andrea Montanari , Feng Ruan , Basil Saeed , Youngtak Sohn

This paper investigates the asymptotic behavior of the soft-margin and hard-margin support vector machine (SVM) classifiers for simultaneously high-dimensional and numerous data (large $n$ and large $p$ with $n/p\to\delta$) drawn from a…

Information Theory · Computer Science 2020-03-31 Abla Kammoun , Mohamed-Slim Alouini

The objective of the paper is to study accuracy of multi-class classification in high-dimensional setting, where the number of classes is also large ("large $L$, large $p$, small $n$" model). While this problem arises in many practical…

Statistics Theory · Mathematics 2019-07-18 Felix Abramovich , Marianna Pensky

This article carries out a large dimensional analysis of standard regularized discriminant analysis classifiers designed on the assumption that data arise from a Gaussian mixture model with different means and covariances. The analysis…

Machine Learning · Statistics 2019-06-19 Khalil Elkhalil , Abla Kammoun , Romain Couillet , Tareq Y. Al-Naffouri , Mohamed-Slim Alouini

This paper serves as a survey of recent advances in large margin training and its theoretical foundations, mostly for (nonlinear) deep neural networks (DNNs) that are probably the most prominent machine learning models for large-scale data…

Machine Learning · Computer Science 2021-06-22 Yiwen Guo , Changshui Zhang

Numbers and numerical vectors account for a large portion of data. However, recently the amount of string data generated has increased dramatically. Consequently, classifying string data is a common problem in many fields. The most widely…

Machine Learning · Statistics 2016-02-24 Hitoshi Koyano , Morihiro Hayashida , Tatsuya Akutsu

In recent years, pattern analysis plays an important role in data mining and recognition, and many variants have been proposed to handle complicated scenarios. In the literature, it has been quite familiar with high dimensionality of data…

Machine Learning · Computer Science 2018-11-09 Miao Cheng , Zunren Liu , Hongwei Zou , Ah Chung Tsoi

In many real-world applications, data is not collected as one batch, but sequentially over time, and often it is not possible or desirable to wait until the data is completely gathered before analyzing it. Thus, we propose a framework to…

Machine Learning · Statistics 2018-03-09 Elizabeth Hou , Alfred O. Hero

We obtain a tight distribution-specific characterization of the sample complexity of large-margin classification with L2 regularization: We introduce the margin-adapted dimension, which is a simple function of the second order statistics of…

Machine Learning · Statistics 2013-09-19 Sivan Sabato , Nathan Srebro , Naftali Tishby

Datasets containing both categorical and continuous variables are frequently encountered in many areas, and with the rapid development of modern measurement technologies, the dimensions of these variables can be very high. Despite the…

Methodology · Statistics 2024-01-03 Binyan Jiang , Chenlei Leng , Cheng Wang , Zhongqing Yang , Xinyang Yu

Integrating the outputs of multiple classifiers via combiners or meta-learners has led to substantial improvements in several difficult pattern recognition problems. In the typical setting investigated till now, each classifier is trained…

Machine Learning · Computer Science 2007-05-23 Kagan Tumer , Joydeep Ghosh

Various applications in different fields, such as gene expression analysis or computer vision, suffer from data sets with high-dimensional low-sample-size (HDLSS), which has posed significant challenges for standard statistical and modern…

Machine Learning · Computer Science 2021-01-27 Qingbo Yin , Ehsan Adeli , Liran Shen , Dinggang Shen

Datasets from the fields of bioinformatics, chemometrics, and face recognition are typically characterized by small samples of high-dimensional data. Among the many variants of linear discriminant analysis that have been proposed in order…

Machine Learning · Statistics 2020-04-20 Lama B. Niyazi , Abla Kammoun , Hayssam Dahrouj , Mohamed-Slim Alouini , Tareq Y. Al-Naffouri

We consider the classification problem of a high-dimensional mixture of two Gaussians with general covariance matrices. Using the replica method from statistical physics, we investigate the asymptotic behavior of a general class of…

Machine Learning · Statistics 2024-10-29 Hanwen Huang , Peng Zeng

We present a formulation of deep learning that aims at producing a large margin classifier. The notion of margin, minimum distance to a decision boundary, has served as the foundation of several theoretically profound and empirically…

Machine Learning · Statistics 2018-12-05 Gamaleldin F. Elsayed , Dilip Krishnan , Hossein Mobahi , Kevin Regan , Samy Bengio

Classification is an important topic in statistics and machine learning with great potential in many real applications. In this paper, we investigate two popular large margin classification methods, Support Vector Machine (SVM) and Distance…

Machine Learning · Statistics 2013-10-14 Xingye Qiao , Lingsong Zhang

Support vector machine (SVM) has been one of the most popular learning algorithms, with the central idea of maximizing the minimum margin, i.e., the smallest distance from the instances to the classification boundary. Recent theoretical…

Machine Learning · Computer Science 2014-05-26 Teng Zhang , Zhi-Hua Zhou
‹ Prev 1 2 3 10 Next ›