English
Related papers

Related papers: Competitive Gradient Optimization

200 papers

Two of the most prominent algorithms for solving unconstrained smooth games are the classical stochastic gradient descent-ascent (SGDA) and the recently introduced stochastic consensus optimization (SCO) [Mescheder et al., 2017]. SGDA is…

Machine Learning · Computer Science 2021-11-05 Nicolas Loizou , Hugo Berard , Gauthier Gidel , Ioannis Mitliagkas , Simon Lacoste-Julien

We study the convergence of Optimistic Gradient Descent Ascent in unconstrained bilinear games. In a first part, we consider the zero-sum case and extend previous results by Daskalakis et al. in 2018, Liang and Stokes in 2019, and others:…

Optimization and Control · Mathematics 2022-11-24 Étienne de Montbrun , Jérôme Renault

Convergence to a saddle point for convex-concave functions has been studied for decades, while recent years has seen a surge of interest in non-convex (zero-sum) smooth games, motivated by their recent wide applications. It remains an…

Machine Learning · Computer Science 2022-02-04 Guojun Zhang , Pascal Poupart , Yaoliang Yu

A core challenge in policy optimization in competitive Markov decision processes is the design of efficient optimization methods with desirable convergence and stability properties. To tackle this, we propose competitive policy optimization…

Machine Learning · Computer Science 2020-06-19 Manish Prajapat , Kamyar Azizzadenesheli , Alexander Liniger , Yisong Yue , Anima Anandkumar

In this paper, we provide a novel analytical perspective on the theoretical understanding of gradient-based learning algorithms by interpreting consensus-based optimization (CBO), a recently proposed multi-particle derivative-free…

Machine Learning · Computer Science 2026-03-02 Konstantin Riedl , Timo Klock , Carina Geldhauser , Massimo Fornasier

We study the alternating gradient descent-ascent (AltGDA) algorithm in two-player zero-sum games. Alternating methods, where players take turns to update their strategies, have long been recognized as simple and practical approaches for…

Computer Science and Game Theory · Computer Science 2026-03-03 Tianlong Nan , Shuvomoy Das Gupta , Garud Iyengar , Christian Kroer

Motivated by the pursuit of a systematic computational and algorithmic understanding of Generative Adversarial Networks (GANs), we present a simple yet unified non-asymptotic local convergence theory for smooth two-player games, which…

Machine Learning · Statistics 2020-07-27 Tengyuan Liang , James Stokes

In this paper we consider solving saddle point problems using two variants of Gradient Descent-Ascent algorithms, Extra-gradient (EG) and Optimistic Gradient Descent Ascent (OGDA) methods. We show that both of these algorithms admit a…

Optimization and Control · Mathematics 2019-09-06 Aryan Mokhtari , Asuman Ozdaglar , Sarath Pattathil

Centered around solving the Online Saddle Point problem, this paper introduces the Online Convex-Concave Optimization (OCCO) framework, which involves a sequence of two-player time-varying convex-concave games. We propose the generalized…

Machine Learning · Computer Science 2023-12-18 Qing-xin Meng , Jian-wei Liu

The rapid progress in machine learning in recent years has been based on a highly productive connection to gradient-based optimization. Further progress hinges in part on a shift in focus from pattern recognition to decision-making and…

Machine Learning · Computer Science 2024-02-27 Neha S. Wadia , Yatin Dandi , Michael I. Jordan

Graph matching aims to find correspondences between two graphs. This paper integrates several well-known graph matching algorithms into a framework: the constrained gradient method. The primary difference among these algorithms lies in…

Combinatorics · Mathematics 2024-12-11 Binrui Shen , Qiang Niu , Shengxin Zhu

Many economic games and machine learning approaches can be cast as competitive optimization problems where multiple agents are minimizing their respective objective function, which depends on all agents' actions. While gradient descent is a…

Machine Learning · Computer Science 2021-11-17 Jeffrey Ma , Alistair Letcher , Florian Schäfer , Yuanyuan Shi , Anima Anandkumar

We consider differentiable games where the goal is to find a Nash equilibrium. The machine learning community has recently started using variants of the gradient method (GD). Prime examples are extragradient (EG), the optimistic gradient…

Machine Learning · Computer Science 2020-07-08 Waïss Azizian , Ioannis Mitliagkas , Simon Lacoste-Julien , Gauthier Gidel

We introduce a new algorithm for the numerical computation of Nash equilibria of competitive two-player games. Our method is a natural generalization of gradient descent to the two-player setting where the update is given by the Nash…

Optimization and Control · Mathematics 2020-07-02 Florian Schäfer , Anima Anandkumar

The study of learning in games has thus far focused primarily on normal form games. In contrast, our understanding of learning in extensive form games (EFGs) and particularly in EFGs with many agents lags far behind, despite them being…

Computer Science and Game Theory · Computer Science 2022-07-19 Georgios Piliouras , Lillian Ratliff , Ryann Sim , Stratis Skoulakis

Stochastic min-max optimization has gained interest in the machine learning community with the advancements in GANs and adversarial training. Although game optimization is fairly well understood in the deterministic setting, some issues…

Machine Learning · Computer Science 2024-03-27 Juan Ramirez , Rohan Sukumaran , Quentin Bertrand , Gauthier Gidel

We study the performance of the gradient play algorithm for stochastic games (SGs), where each agent tries to maximize its own total discounted reward by making decisions independently based on current state information which is shared…

Machine Learning · Computer Science 2023-12-08 Runyu Zhang , Zhaolin Ren , Na Li

To solve the Unmanned Aerial Vehicle (UAV) path planning problem, a meta-heuristic optimization algorithm called competitive game optimizer (CGO) is proposed. In the CGO model, three phases of exploration and exploitation, and candidate…

Systems and Control · Electrical Eng. & Systems 2024-04-16 Tai-shan Lou , Guang-sheng Guan , Zhe-peng Yue , Yu Wang , Ren-long Qi , Shi-hao Tong

The two-timescale gradient descent-ascent (GDA) is a canonical gradient algorithm designed to find Nash equilibria in min-max games. We analyze the two-timescale GDA by investigating the effects of learning rate ratios on convergence…

Optimization and Control · Mathematics 2025-10-13 Jing An , Jianfeng Lu

Under mild regularity conditions, gradient-based methods converge globally to a critical point in the single-loss setting. This is known to break down for vanilla gradient descent when moving to multi-loss optimization, but can we hope to…

Optimization and Control · Mathematics 2021-01-19 Alistair Letcher
‹ Prev 1 2 3 10 Next ›