Related papers: Optimizing Bivariate Partial Information Decomposi…

BROJA-2PID: A robust estimator for bivariate partial information decomposition

Makkeh, Theis, and Vicente found in [8] that Cone Programming model is the most robust to compute the Bertschinger et al. partial information decompostion (BROJA PID) measure [1]. We developed a production-quality robust software that…

Optimization and Control · Mathematics 2020-02-11 Abdullah Makkeh , Dirk Oliver Theis , Raul Vicente

Partial Information Decomposition via Deficiency for Multivariate Gaussians

Bivariate partial information decompositions (PIDs) characterize how the information in a "message" random variable is decomposed between two "constituent" random variables in terms of unique, redundant and synergistic information…

Information Theory · Computer Science 2023-07-21 Praveen Venkatesh , Gabriel Schamberg

The exact information-based complexity of smooth convex minimization

We obtain a new lower bound on the information-based complexity of first-order minimization of smooth and convex functions. We show that the bound matches the worst-case performance of the recently introduced Optimized Gradient Method,…

Optimization and Control · Mathematics 2016-06-07 Yoel Drori

On Robustness of the Normalized Subgradient Method with Randomly Corrupted Subgradients

Numerous modern optimization and machine learning algorithms rely on subgradient information being trustworthy and hence, they may fail to converge when such information is corrupted. In this paper, we consider the setting where subgradient…

Optimization and Control · Mathematics 2021-03-23 Berkay Turan , Cesar A. Uribe , Hoi-To Wai , Mahnoosh Alizadeh

Radial Subgradient Method

We present a subgradient method for minimizing non-smooth, non-Lipschitz convex optimization problems. The only structure assumed is that a strictly feasible point is known. We extend the work of Renegar [5] by taking a different…

Optimization and Control · Mathematics 2018-02-28 Benjamin Grimmer

Limitations of Information-Theoretic Generalization Bounds for Gradient Descent Methods in Stochastic Convex Optimization

To date, no "information-theoretic" frameworks for reasoning about generalization error have been shown to establish minimax rates for gradient descent in the setting of stochastic convex optimization. In this work, we consider the prospect…

Machine Learning · Computer Science 2023-07-19 Mahdi Haghifam , Borja Rodríguez-Gálvez , Ragnar Thobaben , Mikael Skoglund , Daniel M. Roy , Gintare Karolina Dziugaite

Bilevel Optimization with Lower-Level Uniform Convexity: Theory and Algorithm

Bilevel optimization is a hierarchical framework where an upper-level optimization problem is constrained by a lower-level problem, commonly used in machine learning applications such as hyperparameter optimization. Existing bilevel…

Optimization and Control · Mathematics 2026-03-03 Yuman Wu , Xiaochuan Gong , Jie Hao , Mingrui Liu

A Framework for Optimization under Limited Information

In many real world problems, optimization decisions have to be made with limited information. The decision maker may have no a priori or posteriori data about the often nonconvex objective function except from on a limited number of points…

Optimization and Control · Mathematics 2011-11-10 Tansu Alpcan

Universal subgradient and proximal bundle methods for convex and strongly convex hybrid composite optimization

This paper develops two parameter-free methods for solving convex and strongly convex hybrid composite optimization problems, namely, a composite subgradient type method and a proximal bundle type method. Functional complexity bounds for…

Optimization and Control · Mathematics 2025-11-24 Vincent Guigues , Jiaming Liang , Renato D. C. Monteiro

A Computation and Communication Efficient Method for Distributed Nonconvex Problems in the Partial Participation Setting

We present a new method that includes three key components of distributed optimization and federated learning: variance reduction of stochastic gradients, partial participation, and compressed communication. We prove that the new method has…

Machine Learning · Computer Science 2024-01-04 Alexander Tyurin , Peter Richtárik

PRISMA: PRoximal Iterative SMoothing Algorithm

Motivated by learning problems including max-norm regularized matrix completion and clustering, robust PCA and sparse inverse covariance selection, we propose a novel optimization algorithm for minimizing a convex objective which decomposes…

Optimization and Control · Mathematics 2012-11-20 Francesco Orabona , Andreas Argyriou , Nathan Srebro

Communication Lower Bounds for Distributed Convex Optimization: Partition Data on Features

Recently, there has been an increasing interest in designing distributed convex optimization algorithms under the setting where the data matrix is partitioned on features. Algorithms under this setting sometimes have many advantages over…

Machine Learning · Computer Science 2016-12-05 Zihao Chen , Luo Luo , Zhihua Zhang

An Optimal Hybrid Variance-Reduced Algorithm for Stochastic Composite Nonconvex Optimization

In this note we propose a new variant of the hybrid variance-reduced proximal gradient method in [7] to solve a common stochastic composite nonconvex optimization problem under standard assumptions. We simply replace the independent…

Optimization and Control · Mathematics 2020-08-21 Deyi Liu , Lam M. Nguyen , Quoc Tran-Dinh

A novel exact approach to polynomial optimization

Polynomial optimization problems represent a wide class of optimization problems, with a large number of real-world applications. Current approaches for polynomial optimization, such as the sum of squares (SOS) method, rely on large-scale…

Optimization and Control · Mathematics 2025-07-04 Dimitris Bertsimas , Dick den Hertog , Thodoris Koukouvinos

On the Convergence of SGD with Biased Gradients

We analyze the complexity of biased stochastic gradient methods (SGD), where individual updates are corrupted by deterministic, i.e. biased error terms. We derive convergence results for smooth (non-convex) functions and give improved rates…

Machine Learning · Computer Science 2021-05-11 Ahmad Ajalloeian , Sebastian U. Stich

An operational information decomposition via synergistic disclosure

Multivariate information decompositions hold promise to yield insight into complex systems, and stand out for their ability to identify synergistic phenomena. However, the adoption of these approaches has been hindered by there being…

Information Theory · Computer Science 2020-12-02 Fernando Rosas , Pedro Mediano , Borzoo Rassouli , Adam Barrett

Convex Optimization on Functionals of Probability Densities

In information theory, some optimization problems result in convex optimization problems on strictly convex functionals of probability densities. In this note, we study these problems and show conditions of minimizers and the uniqueness of…

Information Theory · Computer Science 2020-03-17 Tomohiro Nishiyama

Optimal and Practical Algorithms for Smooth and Strongly Convex Decentralized Optimization

We consider the task of decentralized minimization of the sum of smooth strongly convex functions stored across the nodes of a network. For this problem, lower bounds on the number of gradient computations and the number of communication…

Optimization and Control · Mathematics 2020-11-16 Dmitry Kovalev , Adil Salim , Peter Richtárik

Information-constrained optimization: can adaptive processing of gradients help?

We revisit first-order optimization under local information constraints such as local privacy, gradient quantization, and computational constraints limiting access to a few coordinates of the gradient. In this setting, the optimization…

Optimization and Control · Mathematics 2021-04-05 Jayadev Acharya , Clément L. Canonne , Prathamesh Mayekar , Himanshu Tyagi

A stochastic optimisation unadjusted Langevin method for empirical Bayesian estimation in semi-blind image deblurring problems

This paper presents a novel stochastic optimisation methodology to perform empirical Bayesian inference in semi-blind image deconvolution problems. Given a blurred image and a parametric class of possible operators, the proposed…

Applications · Statistics 2024-03-12 Charlesquin Kemajou Mbakam , Marcelo Pereyra , Jean-François Giovannelli