English
Related papers

Related papers: Improving Neural Network Classifier using Gradient…

200 papers

Handling varying computational resources is a critical issue in modern AI applications. Adaptive deep networks, featuring the dynamic employment of multiple classifier heads among different layers, have been proposed to address…

Computer Vision and Pattern Recognition · Computer Science 2024-08-30 Xu Zhang , Zhipeng Xie , Haiyang Yu , Qitong Wang , Peng Wang , Wei Wang

Optimization techniques are of great importance to effectively and efficiently train a deep neural network (DNN). It has been shown that using the first and second order statistics (e.g., mean and variance) to perform Z-score…

Computer Vision and Pattern Recognition · Computer Science 2020-04-09 Hongwei Yong , Jianqiang Huang , Xiansheng Hua , Lei Zhang

Recently, a multi-level fuzzy min max neural network (MLF) was proposed, which improves the classification accuracy by handling an overlapped region (area of confusion) with the help of a tree structure. In this brief, an extension of MLF…

Artificial Intelligence · Computer Science 2016-12-21 Shraddha Deshmukh , Sagar Gandhi , Pratap Sanap , Vivek Kulkarni

Training deep neural networks remains computationally intensive due to the itera2 tive nature of gradient-based optimization. We propose Gradient Flow Matching (GFM), a continuous-time modeling framework that treats neural network training…

Machine Learning · Computer Science 2025-05-27 Xiao Shou , Yanna Ding , Jianxi Gao

Catastrophic forgetting (CF) occurs when a neural network loses the information previously learned while training on a set of samples from a different distribution, i.e., a new task. Existing approaches have achieved remarkable results in…

Machine Learning · Computer Science 2022-09-13 Jary Pomponi , Simone Scardapane , Aurelio Uncini

A classification algorithm, called the Linear Centralization Classifier (LCC), is introduced. The algorithm seeks to find a transformation that best maps instances from the feature space to a space where they concentrate towards the center…

Machine Learning · Computer Science 2017-12-25 Mohammad Reza Bonyadi , Viktor Vegh , David C. Reutens

Deep neural networks (DNN) are typically optimized using stochastic gradient descent (SGD). However, the estimation of the gradient using stochastic samples tends to be noisy and unreliable, resulting in large gradient variance and bad…

Machine Learning · Computer Science 2021-05-18 Xingyi Yang

The simplicity of gradient descent (GD) made it the default method for training ever-deeper and complex neural networks. Both loss functions and architectures are often explicitly tuned to be amenable to this basic local optimization. In…

Machine Learning · Computer Science 2019-04-30 Dmitrii Marin , Meng Tang , Ismail Ben Ayed , Yuri Boykov

This paper introduces the centroid decision forest (CDF), a novel ensemble learning framework that redefines the splitting strategy and tree building in the ordinary decision trees for high-dimensional classification. The splitting approach…

Machine Learning · Statistics 2026-01-09 Amjad Ali , Saeed Aldahmani , Hailiang Du , Zardad Khan

We consider decentralized gradient-free optimization of minimizing Lipschitz continuous functions that satisfy neither smoothness nor convexity assumption. We propose two novel gradient-free algorithms, the Decentralized Gradient-Free…

Optimization and Control · Mathematics 2025-01-29 Zhenwei Lin , Jingfan Xia , Qi Deng , Luo Luo

Federated learning (FL) is an emerging machine learning paradigm that allows to accomplish model training without aggregating data at a central server. Most studies on FL consider a centralized framework, in which a single server is endowed…

Machine Learning · Computer Science 2023-03-22 Bin Wang , Jun Fang , Hongbin Li , Xiaojun Yuan , Qing Ling

We propose Sequential Feature Filtering Classifier (FFC), a simple but effective classifier for convolutional neural networks (CNNs). With sequential LayerNorm and ReLU, FFC zeroes out low-activation units and preserves high-activation…

Computer Vision and Pattern Recognition · Computer Science 2020-06-23 Minseok Seo , Jaemin Lee , Jongchan Park , Dong-Geol Choi

The primal-dual method of multipliers (PDMM) was originally designed for solving a decomposable optimisation problem over a general network. In this paper, we revisit PDMM for optimisation over a centralized network. We first note that the…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-07-21 Guoqiang Zhang , Kenta Niwa , W. Bastiaan Kleijn

As deep learning continues to advance, the transparency of neural network decision-making remains a critical challenge, limiting trust and applicability in high-stakes domains. Class Activation Mapping (CAM) techniques have emerged as a key…

Machine Learning · Computer Science 2025-04-24 Hongjie He , Xu Pan , Yudong Yao

Online Continual Learning (OCL) for image classification represents a challenging subset of Continual Learning, focusing on classifying images from a stream without assuming data independence and identical distribution (i.i.d). The primary…

Machine Learning · Computer Science 2026-03-24 Joe Khawand , David Colliaux

Natural gradient descent (NGD) provided deep insights and powerful tools to deep neural networks. However the computation of Fisher information matrix becomes more and more difficult as the network structure turns large and complex. This…

Machine Learning · Computer Science 2021-09-22 Weihua Liu , Xiabi Liu

Stochastic Gradient Decent (SGD) is one of the core techniques behind the success of deep neural networks. The gradient provides information on the direction in which a function has the steepest rate of change. The main problem with basic…

With increasing appealing to privacy issues in face recognition, federated learning has emerged as one of the most prevalent approaches to study the unconstrained face recognition problem with private decentralized data. However,…

Computer Vision and Pattern Recognition · Computer Science 2021-12-15 Yifan Niu , Weihong Deng

A fundamental issue for federated learning (FL) is how to achieve optimal model performance under highly dynamic communication environments. This issue can be alleviated by the fact that modern edge devices usually can connect to the edge…

Machine Learning · Computer Science 2021-09-21 Haizhou Du , Xiaojie Feng , Qiao Xiang , Haoyu Liu

Computational Fluid Dynamics (CFD) simulation by the numerical solution of the Navier-Stokes equations is an essential tool in a wide range of applications from engineering design to climate modeling. However, the computational cost and…

Computational Physics · Physics 2021-11-29 Mateus Dias Ribeiro , Abdul Rehman , Sheraz Ahmed , Andreas Dengel
‹ Prev 1 2 3 10 Next ›