Related papers: Deep Learning using Linear Support Vector Machines

A Layer Separation Optimization Framework for Cross-Entropy Training in Deep Learning

This paper investigates the deep learning optimization problem with softmax cross-entropy loss. We propose a layer separation strategy to alleviate the strong nonconvexity encountered during training deep networks. For cross-entropy models…

Machine Learning · Computer Science 2026-04-28 Yaru Liu , Michael K. Ng , Yiqi Gu

Large-Margin Softmax Loss for Convolutional Neural Networks

Cross-entropy loss together with softmax is arguably one of the most common used supervision components in convolutional neural networks (CNNs). Despite its simplicity, popularity and excellent performance, the component does not explicitly…

Machine Learning · Statistics 2017-11-21 Weiyang Liu , Yandong Wen , Zhiding Yu , Meng Yang

Support Vector Guided Softmax Loss for Face Recognition

Face recognition has witnessed significant progresses due to the advances of deep convolutional neural networks (CNNs), the central challenge of which, is feature discrimination. To address it, one group tries to exploit mining-based…

Computer Vision and Pattern Recognition · Computer Science 2019-01-01 Xiaobo Wang , Shuo Wang , Shifeng Zhang , Tianyu Fu , Hailin Shi , Tao Mei

An Architecture Combining Convolutional Neural Network (CNN) and Support Vector Machine (SVM) for Image Classification

Convolutional neural networks (CNNs) are similar to "ordinary" neural networks in the sense that they are made up of hidden layers consisting of neurons with "learnable" parameters. These neurons receive inputs, performs a dot product, and…

Computer Vision and Pattern Recognition · Computer Science 2019-02-08 Abien Fred Agarap

Scalable Angular Discriminative Deep Metric Learning for Face Recognition

With the development of deep learning, Deep Metric Learning (DML) has achieved great improvements in face recognition. Specifically, the widely used softmax loss in the training process often bring large intra-class variations, and feature…

Computer Vision and Pattern Recognition · Computer Science 2018-05-02 Bowen Wu , Huaming Wu , Monica M. Y. Zhang

SubFace: Learning with Softmax Approximation for Face Recognition

The softmax-based loss functions and its variants (e.g., cosface, sphereface, and arcface) significantly improve the face recognition performance in wild unconstrained scenes. A common practice of these algorithms is to perform…

Computer Vision and Pattern Recognition · Computer Science 2022-08-25 Hongwei Xu , Suncheng Xiang , Dahong Qian

Support Vector Machine Classifier via $L_{0/1}$ Soft-Margin Loss

Support vector machine (SVM) has attracted great attentions for the last two decades due to its extensive applications, and thus numerous optimization models have been proposed. To distinguish all of them, in this paper, we introduce a new…

Optimization and Control · Mathematics 2021-04-06 Huajun Wang , Yuanhai Shao , Shenglong Zhou , Ce Zhang , Naihua Xiu

L2-constrained Softmax Loss for Discriminative Face Verification

In recent years, the performance of face verification systems has significantly improved using deep convolutional neural networks (DCNNs). A typical pipeline for face verification includes training a deep network for subject classification…

Computer Vision and Pattern Recognition · Computer Science 2017-06-09 Rajeev Ranjan , Carlos D. Castillo , Rama Chellappa

Mis-classified Vector Guided Softmax Loss for Face Recognition

Face recognition has witnessed significant progress due to the advances of deep convolutional neural networks (CNNs), the central task of which is how to improve the feature discrimination. To this end, several margin-based (\textit{e.g.},…

Computer Vision and Pattern Recognition · Computer Science 2019-12-03 Xiaobo Wang , Shifeng Zhang , Shuo Wang , Tianyu Fu , Hailin Shi , Tao Mei

Learning Deep Convolutional Embeddings for Face Representation Using Joint Sample- and Set-based Supervision

In this work, we investigate several methods and strategies to learn deep embeddings for face recognition, using joint sample- and set-based optimization. We explain our framework that expands traditional learning with set-based supervision…

Computer Vision and Pattern Recognition · Computer Science 2020-09-09 Baris Gecer , Vassileios Balntas , Tae-Kyun Kim

A Performance Comparison of Loss Functions for Deep Face Recognition

Face recognition is one of the most widely publicized feature in the devices today and hence represents an important problem that should be studied with the utmost priority. As per the recent trends, the Convolutional Neural Network (CNN)…

Computer Vision and Pattern Recognition · Computer Science 2019-11-07 Yash Srivastava , Vaishnav Murali , Shiv Ram Dubey

Kernelized Classification in Deep Networks

We propose a kernelized classification layer for deep networks. Although conventional deep networks introduce an abundance of nonlinearity for representation (feature) learning, they almost universally use a linear classifier on the learned…

Machine Learning · Computer Science 2021-03-22 Sadeep Jayasumana , Srikumar Ramalingam , Sanjiv Kumar

Log-Linear RNNs: Towards Recurrent Neural Networks with Flexible Prior Knowledge

We introduce LL-RNNs (Log-Linear RNNs), an extension of Recurrent Neural Networks that replaces the softmax output layer by a log-linear output layer, of which the softmax is a special case. This conceptually simple move has two main…

Artificial Intelligence · Computer Science 2016-12-19 Marc Dymetman , Chunyang Xiao

Learning Discriminative Features Via Weights-biased Softmax Loss

Loss functions play a key role in training superior deep neural networks. In convolutional neural networks (CNNs), the popular cross entropy loss together with softmax does not explicitly guarantee minimization of intra-class variance or…

Computer Vision and Pattern Recognition · Computer Science 2019-04-26 XiaoBin Li , WeiQiang Wang

Investigation of Large-Margin Softmax in Neural Language Modeling

To encourage intra-class compactness and inter-class separability among trainable feature vectors, large-margin softmax methods are developed and widely applied in the face recognition community. The introduction of the large-margin concept…

Audio and Speech Processing · Electrical Eng. & Systems 2021-04-22 Jingjing Huo , Yingbo Gao , Weiyue Wang , Ralf Schlüter , Hermann Ney

Unified Backpropagation for Multi-Objective Deep Learning

A common practice in most of deep convolutional neural architectures is to employ fully-connected layers followed by Softmax activation to minimize cross-entropy loss for the sake of classification. Recent studies show that substitution or…

Machine Learning · Computer Science 2017-10-23 Arash Shahriari

On the Effectiveness of Sampled Softmax Loss for Item Recommendation

The learning objective plays a fundamental role to build a recommender system. Most methods routinely adopt either pointwise or pairwise loss to train the model parameters, while rarely pay attention to softmax loss due to its computational…

Information Retrieval · Computer Science 2023-12-20 Jiancan Wu , Xiang Wang , Xingyu Gao , Jiawei Chen , Hongcheng Fu , Tianyu Qiu

Relaxed Softmax for learning from Positive and Unlabeled data

In recent years, the softmax model and its fast approximations have become the de-facto loss functions for deep neural networks when dealing with multi-class prediction. This loss has been extended to language modeling and recommendation,…

Machine Learning · Statistics 2019-09-19 Ugo Tanielian , Flavian Vasile

Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous Outputs

The Softmax function is used in the final layer of nearly all existing sequence-to-sequence models for language generation. However, it is usually the slowest layer to compute which limits the vocabulary size to a subset of most frequent…

Computation and Language · Computer Science 2019-03-25 Sachin Kumar , Yulia Tsvetkov

X2-Softmax: Margin Adaptive Loss Function for Face Recognition

Learning the discriminative features of different faces is an important task in face recognition. By extracting face features in neural networks, it becomes easy to measure the similarity of different face images, which makes face…

Computer Vision and Pattern Recognition · Computer Science 2023-12-20 Jiamu Xu , Xiaoxiang Liu , Xinyuan Zhang , Yain-Whar Si , Xiaofan Li , Zheng Shi , Ke Wang , Xueyuan Gong