最优化与控制

Absorbing Markov Decision Processes: Geometric Properties and Sufficiency of Finite Mixtures of Deterministic Policies

In this paper we investigate several geometric properties of the set of occupancy measures. In particular, we analyse the structure of the faces generated by a given occupancy measure, together with their relative algebraic interior. We…

最优化与控制 · 数学 2025-12-22 Francois Dufour , Tomas Prieto-Rumeau

Distributed Rotary Coverage Control of Multi-Agent Systems in Uncertain Environments

It is always a challenging task for multi-agent systems to achieve efficient and robust coverage in uncertain environments. The absence of global positioning information on the uncertain environment introduces significant complexity to the…

最优化与控制 · 数学 2025-12-22 Chao Zhai , Yanlin Li

Fej\'er and Fej\'er* Monotonicity: New Results and Limiting Examples

Many algorithms in convex optimization and variational analysis can be analyzed using Fej\'er monotone sequences. In 2024, Behling, Bello-Cruz, Iusem, Alves Ribeiro, and Santos introduced a new, more general, notion: Fej\'er* monotonicity.…

最优化与控制 · 数学 2025-12-22 Aleksandr Arakcheev , Heinz H. Bauschke

EBIF: Exact Bilinearization Iterative Form for Control-Affine Nonlinear Systems

In this paper, we develop a novel framework, Exact Bilinearization Iterative Form (EBIF), for transforming a nonlinear control-affine system into an exact finite-dimensional bilinear representation. In contrast to most existing approaches…

最优化与控制 · 数学 2025-12-22 Yuan-Hung Kuan , Jr-Shin Li

Safeguarded Stochastic Polyak Step Sizes for Non-smooth Optimization: Robust Performance Without Small (Sub)Gradients

The stochastic Polyak step size (SPS) has proven to be a promising choice for stochastic gradient descent (SGD), delivering competitive performance relative to state-of-the-art methods on smooth convex and non-convex optimization problems,…

最优化与控制 · 数学 2025-12-22 Dimitris Oikonomou , Nicolas Loizou

Mathematical Analysis and Modeling of Ebola Virus Dynamics via Optimal Control and Neural Network Paradigms

Ebola virus disease is a severe hemorrhagic fever with rapid transmission through infected fluids and surfaces. We develop a fractional-order model using Caputo derivatives to capture memory effects in disease dynamics. An eight-compartment…

最优化与控制 · 数学 2025-12-22 Noor Muhammad , Md. Nur Alam , Zhang Shiqing

Quantum Alternating Direction Method of Multipliers for Semidefinite Programming

Semidefinite programming (SDP) is a fundamental convex optimization problem with wide-ranging applications. However, solving large-scale instances remains computationally challenging due to the high cost of solving linear systems and…

最优化与控制 · 数学 2025-12-22 Hantao Nie , Dong An , Zaiwen Wen

A measure-valued HJB perspective on Bayesian optimal adaptive control

We consider a Bayesian adaptive optimal stochastic control problem where a hidden static signal has a non-separable influence on the drift of a noisy observation. Being allowed to control the specific form of this dependence, we aim at…

最优化与控制 · 数学 2025-12-22 Alexander M. G. Cox , Sigrid Källblad , Chaorui Wang

Optimal Ratcheting of Dividends with Irreversible Reinsurance

This paper considers an insurance company that faces two key constraints: a ratcheting dividend constraint and an irreversible reinsurance constraint. The company allocates part of its reserve to pay dividends to its shareholders while…

最优化与控制 · 数学 2025-12-22 Tim J. Boonen , Engel John C. Dela Vega

Constructing Tight Quadratic Relaxations for Global Optimization: II. Underestimating Difference-of-Convex (D.C.) Functions

Recent advances in the efficiency and robustness of algorithms solving convex quadratically constrained quadratic programming (QCQP) problems motivate developing techniques for creating convex quadratic relaxations that, although more…

最优化与控制 · 数学 2025-12-22 William R. Strahl , Arvind U. Raghunathan , Nikolaos V. Sahinidis , Chrysanthos E. Gounaris

Constructing Tight Quadratic Relaxations for Global Optimization: I. Outer-Approximating Twice-Differentiable Convex Functions

When computing bounds, spatial branch-and-bound algorithms often linearly outer approximate convex relaxations for non-convex expressions in order to capitalize on the efficiency and robustness of linear programming solvers. Considering…

最优化与控制 · 数学 2025-12-22 William R. Strahl , Arvind U. Raghunathan , Nikolaos V. Sahinidis , Chrysanthos E. Gounaris

Unifying Distributionally Robust Optimization via Optimal Transport Theory

In recent years, two prominent paradigms have shaped distributionally robust optimization (DRO), modeling distributional ambiguity through $\phi$-divergences and Wasserstein distances, respectively. While the former focuses on ambiguity in…

最优化与控制 · 数学 2025-12-22 Jose Blanchet , Daniel Kuhn , Jiajin Li , Bahar Taskesen

On the ROF Model in Rectilinear Anisotropy: Piecewise Constant Approximation and Universal Minimality

We prove that the $L^2$ distance between the minimizer of the $\ell^1$-anisotropic Rudin-Osher-Fatemi (ROF) functional and its minimizer over the space of piecewise constant functions on a rectilinear grid is $\mathcal{O}(h^{\frac12 -…

最优化与控制 · 数学 2025-12-22 Clemens Kirisits , Eric Setterqvist

A survey of the orienteering problem: model evolution, algorithmic advances, and future directions

The orienteering problem (OP) is a combinatorial optimization problem that seeks a path visiting a subset of locations to maximize collected rewards under a limited resource budget. This article presents a systematic PRISMA-based review of…

最优化与控制 · 数学 2025-12-19 Songhao Shen , Yufeng Zhou , Qin Lei , Zhibin Wu

Lower bounds for ranking-based pivot rules

The existence of a polynomial pivot rule for the simplex method for linear programming, policy iteration for Markov decision processes, and strategy improvement for parity games each are prominent open problems in their respective fields.…

最优化与控制 · 数学 2025-12-19 Yann Disser , Georg Loho , Matthew Maat , Nils Mosis

The Bi-objective Electric Autonomous Dial-a-Ride Problem

The electric autonomous dial-a-ride problem (E-ADARP) introduces electric, autonomously driving vehicles and their unique requirements into the classic dial-a-ride problem, where people are transported between pickup and drop-off locations.…

最优化与控制 · 数学 2025-12-19 Yue Su , Sophie N. Parragh , Nicolas Dupin , Jakob Puchinger

Muon is Provably Faster with Momentum Variance Reduction

Recent empirical research has demonstrated that deep learning optimizers based on the linear minimization oracle (LMO) over specifically chosen Non-Euclidean norm balls, such as Muon and Scion, outperform Adam-type methods in the training…

最优化与控制 · 数学 2025-12-19 Xun Qian , Hussein Rammal , Dmitry Kovalev , Peter Richtárik

Non-Asymptotic Global Convergence of PPO-Clip

Reinforcement learning (RL) has gained attention for aligning large language models (LLMs) via reinforcement learning from human feedback (RLHF). The actor-only variants of Proximal Policy Optimization (PPO) are widely applied for their…

最优化与控制 · 数学 2025-12-19 Yin Liu , Qiming Dai , Junyu Zhang , Zaiwen Wen

Classical solution to second-order Hamilton-Jacobi-Bellman equation and optimal feedback control for linear-convex problem

In this paper, we are concerned with the classical solvability of a class of second-order Hamilton-Jacobi-Bellman equations (HJB equations) arising from stochastic optimal control problems with linear dynamics and uniformly convex cost…

最优化与控制 · 数学 2025-12-19 Jinghua Li , Zhiyong Yu

Existence of Solutions for Non-monotone VIs and Implications for Games

In this paper, we study the existence of solutions in non-monotone variational inequalities (VIs) through the normal mapping properties. In particular, we show that when the normal mapping $F_K^{\rm nor}(\cdot)$ is norm coercive over a set…

最优化与控制 · 数学 2025-12-19 Sina Arefizadeh , Angelia Nedić