Related papers: Improving Neural Network Classifier using Gradient…

Enhancing Adaptive Deep Networks for Image Classification via Uncertainty-aware Decision Fusion

Handling varying computational resources is a critical issue in modern AI applications. Adaptive deep networks, featuring the dynamic employment of multiple classifier heads among different layers, have been proposed to address…

Computer Vision and Pattern Recognition · Computer Science 2024-08-30 Xu Zhang , Zhipeng Xie , Haiyang Yu , Qitong Wang , Peng Wang , Wei Wang

Gradient Centralization: A New Optimization Technique for Deep Neural Networks

Optimization techniques are of great importance to effectively and efficiently train a deep neural network (DNN). It has been shown that using the first and second order statistics (e.g., mean and variance) to perform Z-score…

Computer Vision and Pattern Recognition · Computer Science 2020-04-09 Hongwei Yong , Jianqiang Huang , Xiansheng Hua , Lei Zhang

Data Centroid Based Multi-Level Fuzzy Min-Max Neural Network

Recently, a multi-level fuzzy min max neural network (MLF) was proposed, which improves the classification accuracy by handling an overlapped region (area of confusion) with the help of a tree structure. In this brief, an extension of MLF…

Artificial Intelligence · Computer Science 2016-12-21 Shraddha Deshmukh , Sagar Gandhi , Pratap Sanap , Vivek Kulkarni

Gradient Flow Matching for Learning Update Dynamics in Neural Network Training

Training deep neural networks remains computationally intensive due to the itera2 tive nature of gradient-based optimization. We propose Gradient Flow Matching (GFM), a continuous-time modeling framework that treats neural network training…

Machine Learning · Computer Science 2025-05-27 Xiao Shou , Yanna Ding , Jianxi Gao

Centroids Matching: an efficient Continual Learning approach operating in the embedding space

Catastrophic forgetting (CF) occurs when a neural network loses the information previously learned while training on a set of samples from a different distribution, i.e., a new task. Existing approaches have achieved remarkable results in…

Machine Learning · Computer Science 2022-09-13 Jary Pomponi , Simone Scardapane , Aurelio Uncini

Linear centralization classifier

A classification algorithm, called the Linear Centralization Classifier (LCC), is introduced. The algorithm seeks to find a transformation that best maps instances from the feature space to a space where they concentrate towards the center…

Machine Learning · Computer Science 2017-12-25 Mohammad Reza Bonyadi , Viktor Vegh , David C. Reutens

Stochastic Gradient Variance Reduction by Solving a Filtering Problem

Deep neural networks (DNN) are typically optimized using stochastic gradient descent (SGD). However, the estimation of the gradient using stochastic samples tends to be noisy and unreliable, resulting in large gradient variance and bad…

Machine Learning · Computer Science 2021-05-18 Xingyi Yang

Beyond Gradient Descent for Regularized Segmentation Losses

The simplicity of gradient descent (GD) made it the default method for training ever-deeper and complex neural networks. Both loss functions and architectures are often explicitly tuned to be amenable to this basic local optimization. In…

Machine Learning · Computer Science 2019-04-30 Dmitrii Marin , Meng Tang , Ismail Ben Ayed , Yuri Boykov

Centroid Decision Forest

This paper introduces the centroid decision forest (CDF), a novel ensemble learning framework that redefines the splitting strategy and tree building in the ordinary decision trees for high-dimensional classification. The splitting approach…

Machine Learning · Statistics 2026-01-09 Amjad Ali , Saeed Aldahmani , Hailiang Du , Zardad Khan

Decentralized Gradient-Free Methods for Stochastic Non-Smooth Non-Convex Optimization

We consider decentralized gradient-free optimization of minimizing Lipschitz continuous functions that satisfy neither smoothness nor convexity assumption. We propose two novel gradient-free algorithms, the Decentralized Gradient-Free…

Optimization and Control · Mathematics 2025-01-29 Zhenwei Lin , Jingfan Xia , Qi Deng , Luo Luo

Confederated Learning: Federated Learning with Decentralized Edge Servers

Federated learning (FL) is an emerging machine learning paradigm that allows to accomplish model training without aggregating data at a central server. Most studies on FL consider a centralized framework, in which a single server is endowed…

Machine Learning · Computer Science 2023-03-22 Bin Wang , Jun Fang , Hongbin Li , Xiaojun Yuan , Qing Ling

Sequential Feature Filtering Classifier

We propose Sequential Feature Filtering Classifier (FFC), a simple but effective classifier for convolutional neural networks (CNNs). With sequential LayerNorm and ReLU, FFC zeroes out low-activation units and preserves high-activation…

Computer Vision and Pattern Recognition · Computer Science 2020-06-23 Minseok Seo , Jaemin Lee , Jongchan Park , Dong-Geol Choi

Revisiting the Primal-Dual Method of Multipliers for Optimisation over Centralised Networks

The primal-dual method of multipliers (PDMM) was originally designed for solving a decomposable optimisation problem over a general network. In this paper, we revisit PDMM for optimisation over a centralized network. We first note that the…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-07-21 Guoqiang Zhang , Kenta Niwa , W. Bastiaan Kleijn

CF-CAM: Cluster Filter Class Activation Mapping for Reliable Gradient-Based Interpretability

As deep learning continues to advance, the transparency of neural network decision-making remains a critical challenge, limiting trust and applicability in high-stakes domains. Class Activation Mapping (CAM) techniques have emerged as a key…

Machine Learning · Computer Science 2025-04-24 Hongjie He , Xu Pan , Yudong Yao

Natural Gradient Descent for Online Continual Learning

Online Continual Learning (OCL) for image classification represents a challenging subset of Continual Learning, focusing on classifying images from a stream without assuming data independence and identical distribution (i.i.d). The primary…

Machine Learning · Computer Science 2026-03-24 Joe Khawand , David Colliaux

A Novel Structured Natural Gradient Descent for Deep Learning

Natural gradient descent (NGD) provided deep insights and powerful tools to deep neural networks. However the computation of Fisher information matrix becomes more and more difficult as the network structure turns large and complex. This…

Machine Learning · Computer Science 2021-09-22 Weihua Liu , Xiabi Liu

diffGrad: An Optimization Method for Convolutional Neural Networks

Stochastic Gradient Decent (SGD) is one of the core techniques behind the success of deep neural networks. The gradient provides information on the direction in which a function has the steepest rate of change. The main problem with basic…

Machine Learning · Computer Science 2021-11-30 Shiv Ram Dubey , Soumendu Chakraborty , Swalpa Kumar Roy , Snehasis Mukherjee , Satish Kumar Singh , Bidyut Baran Chaudhuri

Federated Learning for Face Recognition with Gradient Correction

With increasing appealing to privacy issues in face recognition, federated learning has emerged as one of the most prevalent approaches to study the unconstrained face recognition problem with private decentralized data. However,…

Computer Vision and Pattern Recognition · Computer Science 2021-12-15 Yifan Niu , Weihong Deng

Toward Efficient Federated Learning in Multi-Channeled Mobile Edge Network with Layerd Gradient Compression

A fundamental issue for federated learning (FL) is how to achieve optimal model performance under highly dynamic communication environments. This issue can be alleviated by the fact that modern edge devices usually can connect to the edge…

Machine Learning · Computer Science 2021-09-21 Haizhou Du , Xiaojie Feng , Qiao Xiang , Haoyu Liu

DeepCFD: Efficient Steady-State Laminar Flow Approximation with Deep Convolutional Neural Networks

Computational Fluid Dynamics (CFD) simulation by the numerical solution of the Navier-Stokes equations is an essential tool in a wide range of applications from engineering design to climate modeling. However, the computational cost and…

Computational Physics · Physics 2021-11-29 Mateus Dias Ribeiro , Abdul Rehman , Sheraz Ahmed , Andreas Dengel