Related papers: A Cost-based Optimizer for Gradient Descent Optimi…

Optimizing ML Training with Metagradient Descent

A major challenge in training large-scale machine learning models is configuring the training process to maximize model performance, i.e., finding the best training setup from a vast design space. In this work, we unlock a gradient-based…

Machine Learning · Statistics 2025-03-19 Logan Engstrom , Andrew Ilyas , Benjamin Chen , Axel Feldmann , William Moses , Aleksander Madry

Learning complexity of gradient descent and conjugate gradient algorithms

Gradient Descent (GD) and Conjugate Gradient (CG) methods are among the most effective iterative algorithms for solving unconstrained optimization problems, particularly in machine learning and statistical modeling, where they are employed…

Optimization and Control · Mathematics 2024-12-19 Xianqi Jiao , Jia Liu , Zhiping Chen

Costing Generated Runtime Execution Plans for Large-Scale Machine Learning Programs

Declarative large-scale machine learning (ML) aims at the specification of ML algorithms in a high-level language and automatic generation of hybrid runtime execution plans ranging from single node, in-memory computations to distributed…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-03-24 Matthias Boehm

Generalization to the Natural Gradient Descent

Optimization problem, which is aimed at finding the global minimal value of a given cost function, is one of the central problem in science and engineering. Various numerical methods have been proposed to solve this problem, among which the…

Optimization and Control · Mathematics 2022-10-07 Shaojun Dong , Fengyu Le , Meng Zhang , Si-Jing Tao , Chao Wang , Yong-Jian Han , Guo-Ping Guo

Gradient Descent Methods for Regularized Optimization

Regularization is a widely recognized technique in mathematical optimization. It can be used to smooth out objective functions, refine the feasible solution set, or prevent overfitting in machine learning models. Due to its simplicity and…

Optimization and Control · Mathematics 2024-12-31 Filip Nikolovski , Irena Stojkovska , Katerina Hadzi-Velkova Saneva , Zoran Hadzi-Velkov

Balancing Communication and Computation in Distributed Optimization

Methods for distributed optimization have received significant attention in recent years owing to their wide applicability in various domains. A distributed optimization method typically consists of two key components: communication and…

Optimization and Control · Mathematics 2018-06-04 Albert S. Berahas , Raghu Bollapragada , Nitish Shirish Keskar , Ermin Wei

Decision-Oriented Learning Using Differentiable Submodular Maximization for Multi-Robot Coordination

We present a differentiable, decision-oriented learning framework for cost prediction in a class of multi-robot decision-making problems, in which the robots need to trade off the task performance with the costs of taking actions when they…

Robotics · Computer Science 2024-03-27 Guangyao Shi , Chak Lam Shek , Nare Karapetyan , Pratap Tokekar

A New Perspective of Accelerated Gradient Methods: The Controlled Invariant Manifold Approach

Gradient Descent (GD) is a ubiquitous algorithm for finding the optimal solution to an optimization problem. For reduced computational complexity, the optimal solution $\mathrm{x^*}$ of the optimization problem must be attained in a minimum…

Optimization and Control · Mathematics 2023-06-01 Revati Gunjal , Sushama Wagh , Syed Shadab Nayyer , Alex Stankovic , Navdeep M. Singh

Gradient-Based Mixed Planning with Symbolic and Numeric Action Parameters

Dealing with planning problems with both logical relations and numeric changes in real-world dynamic environments is challenging. Existing numeric planning systems for the problem often discretize numeric variables or impose convex…

Artificial Intelligence · Computer Science 2022-10-11 Kebing Jin , Hankz Hankui Zhuo , Zhanhao Xiao , Hai Wan , Subbarao Kambhampati

AutoGD: Automatic Learning Rate Selection for Gradient Descent

The performance of gradient-based optimization methods, such as standard gradient descent (GD), greatly depends on the choice of learning rate. However, it can require a non-trivial amount of user tuning effort to select an appropriate…

Machine Learning · Computer Science 2025-10-14 Nikola Surjanovic , Alexandre Bouchard-Côté , Trevor Campbell

Adaptive Sequential Optimization with Applications to Machine Learning

A framework is introduced for solving a sequence of slowly changing optimization problems, including those arising in regression and classification applications, using optimization algorithms such as stochastic gradient descent (SGD). The…

Machine Learning · Computer Science 2015-09-25 Craig Wilson , Venugopal V. Veeravalli

Learning to Optimize Under Constraints with Unsupervised Deep Neural Networks

In this paper, we propose a machine learning (ML) method to learn how to solve a generic constrained continuous optimization problem. To the best of our knowledge, the generic methods that learn to optimize, focus on unconstrained…

Machine Learning · Computer Science 2021-01-05 Seyedrazieh Bayati , Faramarz Jabbarvaziri

LLM as a Complementary Optimizer to Gradient Descent: A Case Study in Prompt Tuning

Mastering a skill generally relies on both hands-on experience from doers and insightful, high-level guidance by mentors. Will this strategy also work well for solving complex non-convex optimization problems? Here, a common gradient-based…

Computer Vision and Pattern Recognition · Computer Science 2024-12-05 Zixian Guo , Ming Liu , Zhilong Ji , Jinfeng Bai , Yiwen Guo , Wangmeng Zuo

Adaptive Sequential Machine Learning

A framework previously introduced in [3] for solving a sequence of stochastic optimization problems with bounded changes in the minimizers is extended and applied to machine learning problems such as regression and classification. The…

Machine Learning · Computer Science 2019-04-08 Craig Wilson , Yuheng Bu , Venugopal Veeravalli

Accelerating Ill-Conditioned Low-Rank Matrix Estimation via Scaled Gradient Descent

Low-rank matrix estimation is a canonical problem that finds numerous applications in signal processing, machine learning and imaging science. A popular approach in practice is to factorize the matrix into two compact low-rank factors, and…

Machine Learning · Computer Science 2021-06-16 Tian Tong , Cong Ma , Yuejie Chi

Gradient Correction beyond Gradient Descent

The great success neural networks have achieved is inseparable from the application of gradient-descent (GD) algorithms. Based on GD, many variant algorithms have emerged to improve the GD optimization process. The gradient for…

Machine Learning · Computer Science 2023-05-29 Zefan Li , Bingbing Ni , Teng Li , WenJun Zhang , Wen Gao

Decision-Focused Learning to Predict Action Costs for Planning

In many automated planning applications, action costs can be hard to specify. An example is the time needed to travel through a certain road segment, which depends on many factors, such as the current weather conditions. A natural way to…

Artificial Intelligence · Computer Science 2024-08-27 Jayanta Mandi , Marco Foschini , Daniel Holler , Sylvie Thiebaux , Jorg Hoffmann , Tias Guns

Efficient Distance Metric Learning by Adaptive Sampling and Mini-Batch Stochastic Gradient Descent (SGD)

Distance metric learning (DML) is an important task that has found applications in many domains. The high computational cost of DML arises from the large number of variables to be determined and the constraint that a distance metric has to…

Machine Learning · Computer Science 2013-04-05 Qi Qian , Rong Jin , Jinfeng Yi , Lijun Zhang , Shenghuo Zhu

Towards Differentiable Multilevel Optimization: A Gradient-Based Approach

Multilevel optimization has gained renewed interest in machine learning due to its promise in applications such as hyperparameter tuning and continual learning. However, existing methods struggle with the inherent difficulty of efficiently…

Machine Learning · Computer Science 2024-10-16 Yuntian Gu , Xuzheng Chen

Manifold Free Riemannian Optimization

Riemannian optimization is a principled framework for solving optimization problems where the desired optimum is constrained to a smooth manifold $\mathcal{M}$. Algorithms designed in this framework usually require some geometrical…

Optimization and Control · Mathematics 2022-09-08 Boris Shustin , Haim Avron , Barak Sober