Related papers: Solving optimization problems with Blackwell appro…

Conic Blackwell Algorithm: Parameter-Free Convex-Concave Saddle-Point Solving

We develop new parameter-free and scale-free algorithms for solving convex-concave saddle-point problems. Our results are based on a new simple regret minimizer, the Conic Blackwell Algorithm$^+$ (CBA$^+$), which attains $O(1/\sqrt{T})$…

Machine Learning · Computer Science 2021-10-15 Julien Grand-Clément , Christian Kroer

Faster Game Solving via Predictive Blackwell Approachability: Connecting Regret Matching and Mirror Descent

Blackwell approachability is a framework for reasoning about repeated games with vector-valued payoffs. We introduce predictive Blackwell approachability, where an estimate of the next payoff vector is given, and the decision maker tries to…

Computer Science and Game Theory · Computer Science 2021-03-09 Gabriele Farina , Christian Kroer , Tuomas Sandholm

Extensive-Form Game Solving via Blackwell Approachability on Treeplexes

In this paper, we introduce the first algorithmic framework for Blackwell approachability on the sequence-form polytope, the class of convex polytopes capturing the strategies of players in extensive-form games (EFGs). This leads to a new…

Computer Science and Game Theory · Computer Science 2024-03-08 Darshan Chakrabarti , Julien Grand-Clément , Christian Kroer

Comparison-Based Algorithms for One-Dimensional Stochastic Convex Optimization

Stochastic optimization finds a wide range of applications in operations research and management science. However, existing stochastic optimization techniques usually require the information of random samples (e.g., demands in the…

Optimization and Control · Mathematics 2019-04-18 Xi Chen , Qihang Lin , Zizhuo Wang

Algorithms for stochastic optimization with functional or expectation constraints

This paper considers the problem of minimizing an expectation function over a closed convex set, coupled with a {\color{black} functional or expectation} constraint on either decision variables or problem parameters. We first present a new…

Optimization and Control · Mathematics 2020-10-05 Guanghui Lan , Zhiqiang Zhou

A stochastic moving ball approximation method for smooth convex constrained minimization

In this paper, we consider constrained optimization problems with convex, smooth objective and constraints. We propose a new stochastic gradient algorithm, called the Stochastic Moving Ball Approximation (SMBA) method, to solve this class…

Optimization and Control · Mathematics 2024-12-03 Nitesh Kumar Singh , Ion Necoara

Consensus-Based Optimization for Saddle Point Problems

In this paper, we propose consensus-based optimization for saddle point problems (CBO-SP), a novel multi-particle metaheuristic derivative-free optimization method capable of provably finding global Nash equilibria. Following the idea of…

Optimization and Control · Mathematics 2024-08-05 Hui Huang , Jinniao Qiu , Konstantin Riedl

Closing the Gaps: Optimality of Sample Average Approximation for Data-Driven Newsvendor Problems

We study the regret performance of Sample Average Approximation (SAA) for data-driven newsvendor problems with general convex inventory costs. In literature, the optimality of SAA has not been fully established under both \alpha-global…

Machine Learning · Computer Science 2024-07-09 Jiameng Lyu , Shilin Yuan , Bingkun Zhou , Yuan Zhou

Achieving Tractable Minimax Optimal Regret in Average Reward MDPs

In recent years, significant attention has been directed towards learning average-reward Markov Decision Processes (MDPs). However, existing algorithms either suffer from sub-optimal regret guarantees or computational inefficiencies. In…

Machine Learning · Computer Science 2024-06-04 Victor Boone , Zihan Zhang

The Online Saddle Point Problem and Online Convex Optimization with Knapsacks

We study the online saddle point problem, an online learning problem where at each iteration a pair of actions need to be chosen without knowledge of the current and future (convex-concave) payoff functions. The objective is to minimize the…

Machine Learning · Statistics 2020-04-07 Adrian Rivera , He Wang , Huan Xu

Approachability, Regret and Calibration; implications and equivalences

Blackwell approachability, regret minimization and calibration are three criteria evaluating a strategy (or an algorithm) in different sequential decision problems, or repeated games between a player and Nature. Although they have at first…

Computer Science and Game Theory · Computer Science 2013-01-15 Vianney Perchet

Non-stationary Bandit Convex Optimization: A Comprehensive Study

Bandit Convex Optimization is a fundamental class of sequential decision-making problems, where the learner selects actions from a continuous domain and observes a loss (but not its gradient) at only one point per round. We study this…

Machine Learning · Statistics 2025-12-02 Xiaoqi Liu , Dorian Baudry , Julian Zimmert , Patrick Rebeschini , Arya Akhavan

Swap Regret Minimization Through Response-Based Approachability

We consider the problem of minimizing different notions of swap regret in online optimization. These forms of regret are tightly connected to correlated equilibrium concepts in games, and have been more recently shown to guarantee…

Machine Learning · Computer Science 2026-05-22 Ioannis Anagnostides , Gabriele Farina , Maxwell Fishelson , Haipeng Luo , Jon Schneider

Pseudonorm Approachability and Applications to Regret Minimization

Blackwell's celebrated approachability theory provides a general framework for a variety of learning problems, including regret minimization. However, Blackwell's proof and implicit algorithm measure approachability using the $\ell_2$…

Machine Learning · Computer Science 2023-02-06 Christoph Dann , Yishay Mansour , Mehryar Mohri , Jon Schneider , Balasubramanian Sivan

Combinatorial Optimization Algorithms via Polymorphisms

An elegant characterization of the complexity of constraint satisfaction problems has emerged in the form of the the algebraic dichotomy conjecture of [BKJ00]. Roughly speaking, the characterization asserts that a CSP {\Lambda} is tractable…

Computational Complexity · Computer Science 2015-01-08 Jonah Brown-Cohen , Prasad Raghavendra

Response-Based Approachability and its Application to Generalized No-Regret Algorithms

Approachability theory, introduced by Blackwell (1956), provides fundamental results on repeated games with vector-valued payoffs, and has been usefully applied since in the theory of learning in games and to learning algorithms in the…

Machine Learning · Computer Science 2013-12-31 Andrey Bernstein , Nahum Shimkin

An Efficient Interior-Point Method for Online Convex Optimization

A new algorithm for regret minimization in online convex optimization is described. The regret of the algorithm after $T$ time periods is $O(\sqrt{T \log T})$ - which is the minimum possible up to a logarithmic term. In addition, the new…

Machine Learning · Computer Science 2023-07-24 Elad Hazan , Nimrod Megiddo

Optimal Regret Algorithm for Pseudo-1d Bandit Convex Optimization

We study online learning with bandit feedback (i.e. learner has access to only zeroth-order oracle) where cost/reward functions $\f_t$ admit a "pseudo-1d" structure, i.e. $\f_t(\w) = \loss_t(\pred_t(\w))$ where the output of $\pred_t$ is…

Machine Learning · Computer Science 2021-02-16 Aadirupa Saha , Nagarajan Natarajan , Praneeth Netrapalli , Prateek Jain

An Online Convex Optimization Approach to Blackwell's Approachability

The notion of approachability in repeated games with vector payoffs was introduced by Blackwell in the 1950s, along with geometric conditions for approachability and corresponding strategies that rely on computing {\em steering directions}…

Computer Science and Game Theory · Computer Science 2015-03-03 Nahum Shimkin

BAGEL: Projection-Free Algorithm for Adversarially Constrained Online Convex Optimization

Projection-based algorithms for Constrained Online Convex Optimization (COCO) achieve optimal $\mathcal{O}(T^{1/2})$ regret guarantees but face scalability challenges due to the computational complexity of projections. To circumvent this,…

Machine Learning · Computer Science 2026-01-29 Yiyang Lu , Mohammad Pedramfar , Vaneet Aggarwal