Related papers: Neural Gradient Learning and Optimization for Orie…

NeuralGF: Unsupervised Point Normal Estimation by Learning Neural Gradient Function

Normal estimation for 3D point clouds is a fundamental task in 3D geometry processing. The state-of-the-art methods rely on priors of fitting local surfaces learned from normal supervision. However, normal supervision in benchmarks comes…

Computer Vision and Pattern Recognition · Computer Science 2023-11-02 Qing Li , Huifang Feng , Kanle Shi , Yue Gao , Yi Fang , Yu-Shen Liu , Zhizhong Han

Optimistic Gradient Learning with Hessian Corrections for High-Dimensional Black-Box Optimization

Black-box algorithms are designed to optimize functions without relying on their underlying analytical structure or gradient information, making them essential when gradients are inaccessible or difficult to compute. Traditional methods for…

Machine Learning · Computer Science 2026-01-21 Yedidya Kfir , Elad Sarafian , Sarit Kraus , Yoram Louzoun

ONG: Orthogonal Natural Gradient Descent

Orthogonal Gradient Descent (OGD) has emerged as a powerful method for continual learning. However, its Euclidean projections do not leverage the underlying information-geometric structure of the problem, which can lead to suboptimal…

Machine Learning · Computer Science 2025-12-09 Yajat Yadav , Patrick Mendoza , Jathin Korrapati

Orthogonal Gradient Descent for Continual Learning

Neural networks are achieving state of the art and sometimes super-human performance on learning tasks across a variety of domains. Whenever these problems require learning in a continual or sequential manner, however, neural networks…

Machine Learning · Computer Science 2019-10-17 Mehrdad Farajtabar , Navid Azizan , Alex Mott , Ang Li

Learning Provably Improves the Convergence of Gradient Descent

Learn to Optimize (L2O) trains deep neural network-based solvers for optimization, achieving success in accelerating convex problems and improving non-convex solutions. However, L2O lacks rigorous theoretical backing for its own training…

Machine Learning · Computer Science 2025-12-24 Qingyu Song , Wei Lin , Hong Xu

Local Grid Rendering Networks for 3D Object Detection in Point Clouds

The performance of 3D object detection models over point clouds highly depends on their capability of modeling local geometric patterns. Conventional point-based models exploit local patterns through a symmetric function (e.g. max pooling)…

Computer Vision and Pattern Recognition · Computer Science 2020-07-07 Jianan Li , Jiashi Feng

Approximation Results for Gradient Descent trained Neural Networks

The paper contains approximation guarantees for neural networks that are trained with gradient flow, with error measured in the continuous $L_2(\mathbb{S}^{d-1})$-norm on the $d$-dimensional unit sphere and targets that are Sobolev smooth.…

Machine Learning · Computer Science 2023-09-12 G. Welper

Natural gradient descent with momentum

We consider the problem of approximating a function by an element of a nonlinear manifold which admits a differentiable parametrization, typical examples being neural networks with differentiable activation functions or tensor networks.…

Machine Learning · Computer Science 2026-04-20 Anthony Nouy , Agustín Somacal

How Does Gradient Descent Learn Features -- A Local Analysis for Regularized Two-Layer Neural Networks

The ability of learning useful features is one of the major advantages of neural networks. Although recent works show that neural network can operate in a neural tangent kernel (NTK) regime that does not allow feature learning, many works…

Machine Learning · Computer Science 2024-11-06 Mo Zhou , Rong Ge

Noise-Adaptive Layerwise Learning Rates: Accelerating Geometry-Aware Optimization for Deep Neural Network Training

Geometry-aware optimization algorithms, such as Muon, have achieved remarkable success in training deep neural networks (DNNs). These methods leverage the underlying geometry of DNNs by selecting appropriate norms for different layers and…

Machine Learning · Computer Science 2026-02-04 Jie Hao , Xiaochuan Gong , Jie Xu , Zhengdao Wang , Mingrui Liu

Adaptive Gradient Regularization: A Faster and Generalizable Optimization Technique for Deep Neural Networks

Stochastic optimization plays a crucial role in the advancement of deep learning technologies. Over the decades, significant effort has been dedicated to improving the training efficiency and robustness of deep neural networks, via various…

Machine Learning · Computer Science 2024-08-21 Huixiu Jiang , Ling Yang , Yu Bao , Rutong Si , Sikun Yang

Learning Normals of Noisy Points by Local Gradient-Aware Surface Filtering

Estimating normals for noisy point clouds is a persistent challenge in 3D geometry processing, particularly for end-to-end oriented normal estimation. Existing methods generally address relatively clean data and rely on supervised priors to…

Computer Vision and Pattern Recognition · Computer Science 2025-11-27 Qing Li , Huifang Feng , Xun Gong , Yu-Shen Liu

LRGNet: Learnable Region Growing for Class-Agnostic Point Cloud Segmentation

3D point cloud segmentation is an important function that helps robots understand the layout of their surrounding environment and perform tasks such as grasping objects, avoiding obstacles, and finding landmarks. Current segmentation…

Computer Vision and Pattern Recognition · Computer Science 2021-03-17 Jingdao Chen , Zsolt Kira , Yong K. Cho

Gradient Projection Memory for Continual Learning

The ability to learn continually without forgetting the past tasks is a desired attribute for artificial learning systems. Existing approaches to enable such learning in artificial neural networks usually rely on network growth, importance…

Machine Learning · Computer Science 2021-03-18 Gobinda Saha , Isha Garg , Kaushik Roy

NeAF: Learning Neural Angle Fields for Point Normal Estimation

Normal estimation for unstructured point clouds is an important task in 3D computer vision. Current methods achieve encouraging results by mapping local patches to normal vectors or learning local surface fitting using neural networks.…

Computer Vision and Pattern Recognition · Computer Science 2022-12-01 Shujuan Li , Junsheng Zhou , Baorui Ma , Yu-Shen Liu , Zhizhong Han

Auto-Unrolled Proximal Gradient Descent: An AutoML Approach to Interpretable Waveform Optimization

This study explores the combination of automated machine learning (AutoML) with model-based deep unfolding (DU) for optimizing wireless beamforming and waveforms. We convert the iterative proximal gradient descent (PGD) algorithm into a…

Machine Learning · Computer Science 2026-04-23 Ahmet Kaplan

A block coordinate descent optimizer for classification problems exploiting convexity

Second-order optimizers hold intriguing potential for deep learning, but suffer from increased cost and sensitivity to the non-convexity of the loss surface as compared to gradient-based approaches. We introduce a coordinate descent method…

Machine Learning · Computer Science 2020-06-19 Ravi G. Patel , Nathaniel A. Trask , Mamikon A. Gulian , Eric C. Cyr

Convergence Analysis of Natural Gradient Descent for Over-parameterized Physics-Informed Neural Networks

In the context of over-parameterization, there is a line of work demonstrating that randomly initialized (stochastic) gradient descent (GD) converges to a globally optimal solution at a linear convergence rate for the quadratic loss…

Machine Learning · Computer Science 2025-06-16 Xianliang Xu , Ting Du , Wang Kong , Bin Shan , Ye Li , Zhongyi Huang

Gradient Monitored Reinforcement Learning

This paper presents a novel neural network training approach for faster convergence and better generalization abilities in deep reinforcement learning. Particularly, we focus on the enhancement of training and evaluation performance in…

Machine Learning · Computer Science 2020-05-26 Mohammed Sharafath Abdul Hameed , Gavneet Singh Chadha , Andreas Schwung , Steven X. Ding

Gradient Routing: Masking Gradients to Localize Computation in Neural Networks

Neural networks are trained primarily based on their inputs and outputs, without regard for their internal mechanisms. These neglected mechanisms determine properties that are critical for safety, like (i) transparency; (ii) the absence of…

Machine Learning · Computer Science 2024-12-02 Alex Cloud , Jacob Goldman-Wetzler , Evžen Wybitul , Joseph Miller , Alexander Matt Turner