English
Related papers

Related papers: Deep Learning using Linear Support Vector Machines

200 papers

This paper investigates the deep learning optimization problem with softmax cross-entropy loss. We propose a layer separation strategy to alleviate the strong nonconvexity encountered during training deep networks. For cross-entropy models…

Machine Learning · Computer Science 2026-04-28 Yaru Liu , Michael K. Ng , Yiqi Gu

Cross-entropy loss together with softmax is arguably one of the most common used supervision components in convolutional neural networks (CNNs). Despite its simplicity, popularity and excellent performance, the component does not explicitly…

Machine Learning · Statistics 2017-11-21 Weiyang Liu , Yandong Wen , Zhiding Yu , Meng Yang

Face recognition has witnessed significant progresses due to the advances of deep convolutional neural networks (CNNs), the central challenge of which, is feature discrimination. To address it, one group tries to exploit mining-based…

Computer Vision and Pattern Recognition · Computer Science 2019-01-01 Xiaobo Wang , Shuo Wang , Shifeng Zhang , Tianyu Fu , Hailin Shi , Tao Mei

Convolutional neural networks (CNNs) are similar to "ordinary" neural networks in the sense that they are made up of hidden layers consisting of neurons with "learnable" parameters. These neurons receive inputs, performs a dot product, and…

Computer Vision and Pattern Recognition · Computer Science 2019-02-08 Abien Fred Agarap

With the development of deep learning, Deep Metric Learning (DML) has achieved great improvements in face recognition. Specifically, the widely used softmax loss in the training process often bring large intra-class variations, and feature…

Computer Vision and Pattern Recognition · Computer Science 2018-05-02 Bowen Wu , Huaming Wu , Monica M. Y. Zhang

The softmax-based loss functions and its variants (e.g., cosface, sphereface, and arcface) significantly improve the face recognition performance in wild unconstrained scenes. A common practice of these algorithms is to perform…

Computer Vision and Pattern Recognition · Computer Science 2022-08-25 Hongwei Xu , Suncheng Xiang , Dahong Qian

Support vector machine (SVM) has attracted great attentions for the last two decades due to its extensive applications, and thus numerous optimization models have been proposed. To distinguish all of them, in this paper, we introduce a new…

Optimization and Control · Mathematics 2021-04-06 Huajun Wang , Yuanhai Shao , Shenglong Zhou , Ce Zhang , Naihua Xiu

In recent years, the performance of face verification systems has significantly improved using deep convolutional neural networks (DCNNs). A typical pipeline for face verification includes training a deep network for subject classification…

Computer Vision and Pattern Recognition · Computer Science 2017-06-09 Rajeev Ranjan , Carlos D. Castillo , Rama Chellappa

Face recognition has witnessed significant progress due to the advances of deep convolutional neural networks (CNNs), the central task of which is how to improve the feature discrimination. To this end, several margin-based (\textit{e.g.},…

Computer Vision and Pattern Recognition · Computer Science 2019-12-03 Xiaobo Wang , Shifeng Zhang , Shuo Wang , Tianyu Fu , Hailin Shi , Tao Mei

In this work, we investigate several methods and strategies to learn deep embeddings for face recognition, using joint sample- and set-based optimization. We explain our framework that expands traditional learning with set-based supervision…

Computer Vision and Pattern Recognition · Computer Science 2020-09-09 Baris Gecer , Vassileios Balntas , Tae-Kyun Kim

Face recognition is one of the most widely publicized feature in the devices today and hence represents an important problem that should be studied with the utmost priority. As per the recent trends, the Convolutional Neural Network (CNN)…

Computer Vision and Pattern Recognition · Computer Science 2019-11-07 Yash Srivastava , Vaishnav Murali , Shiv Ram Dubey

We propose a kernelized classification layer for deep networks. Although conventional deep networks introduce an abundance of nonlinearity for representation (feature) learning, they almost universally use a linear classifier on the learned…

Machine Learning · Computer Science 2021-03-22 Sadeep Jayasumana , Srikumar Ramalingam , Sanjiv Kumar

We introduce LL-RNNs (Log-Linear RNNs), an extension of Recurrent Neural Networks that replaces the softmax output layer by a log-linear output layer, of which the softmax is a special case. This conceptually simple move has two main…

Artificial Intelligence · Computer Science 2016-12-19 Marc Dymetman , Chunyang Xiao

Loss functions play a key role in training superior deep neural networks. In convolutional neural networks (CNNs), the popular cross entropy loss together with softmax does not explicitly guarantee minimization of intra-class variance or…

Computer Vision and Pattern Recognition · Computer Science 2019-04-26 XiaoBin Li , WeiQiang Wang

To encourage intra-class compactness and inter-class separability among trainable feature vectors, large-margin softmax methods are developed and widely applied in the face recognition community. The introduction of the large-margin concept…

Audio and Speech Processing · Electrical Eng. & Systems 2021-04-22 Jingjing Huo , Yingbo Gao , Weiyue Wang , Ralf Schlüter , Hermann Ney

A common practice in most of deep convolutional neural architectures is to employ fully-connected layers followed by Softmax activation to minimize cross-entropy loss for the sake of classification. Recent studies show that substitution or…

Machine Learning · Computer Science 2017-10-23 Arash Shahriari

The learning objective plays a fundamental role to build a recommender system. Most methods routinely adopt either pointwise or pairwise loss to train the model parameters, while rarely pay attention to softmax loss due to its computational…

Information Retrieval · Computer Science 2023-12-20 Jiancan Wu , Xiang Wang , Xingyu Gao , Jiawei Chen , Hongcheng Fu , Tianyu Qiu

In recent years, the softmax model and its fast approximations have become the de-facto loss functions for deep neural networks when dealing with multi-class prediction. This loss has been extended to language modeling and recommendation,…

Machine Learning · Statistics 2019-09-19 Ugo Tanielian , Flavian Vasile

The Softmax function is used in the final layer of nearly all existing sequence-to-sequence models for language generation. However, it is usually the slowest layer to compute which limits the vocabulary size to a subset of most frequent…

Computation and Language · Computer Science 2019-03-25 Sachin Kumar , Yulia Tsvetkov

Learning the discriminative features of different faces is an important task in face recognition. By extracting face features in neural networks, it becomes easy to measure the similarity of different face images, which makes face…

Computer Vision and Pattern Recognition · Computer Science 2023-12-20 Jiamu Xu , Xiaoxiang Liu , Xinyuan Zhang , Yain-Whar Si , Xiaofan Li , Zheng Shi , Ke Wang , Xueyuan Gong
‹ Prev 1 2 3 10 Next ›