Related papers: Autoregressive Policy Optimization for Constrained…

Simplex Decomposition for Portfolio Allocation Constraints in Reinforcement Learning

Portfolio optimization tasks describe sequential decision problems in which the investor's wealth is distributed across a set of assets. Allocation constraints are used to enforce minimal or maximal investments into particular subsets of…

Artificial Intelligence · Computer Science 2024-04-17 David Winkel , Niklas Strauß , Matthias Schubert , Thomas Seidl

Off-Policy Optimization of Portfolio Allocation Policies under Constraints

The dynamic portfolio optimization problem in finance frequently requires learning policies that adhere to various constraints, driven by investor preferences and risk. We motivate this problem of finding an allocation policy within a…

Artificial Intelligence · Computer Science 2020-12-23 Nymisha Bandi , Theja Tulabandhula

Optimal Constrained Resource Allocation Strategies under Low Risk Circumstances

In this paper we consider multiple constrained resource allocation problems, where the constraints can be specified by formulating activity dependency restrictions or by using game-theoretic models. All the problems are focused on generic…

Data Structures and Algorithms · Computer Science 2009-06-19 Mugurel Ionut Andreica , Madalina Ecaterina Andreica , Costel Visan

Learning policies for resource allocation in business processes

Efficient allocation of resources to activities is pivotal in executing business processes but remains challenging. While resource allocation methodologies are well-established in domains like manufacturing, their application within…

Artificial Intelligence · Computer Science 2024-11-19 J. Middelhuis , R. Lo Bianco , E. Scherzer , Z. A. Bukhsh , I. J. B. F. Adan , R. M. Dijkman

Distributional constrained reinforcement learning for supply chain optimization

This work studies reinforcement learning (RL) in the context of multi-period supply chains subject to constraints, e.g., on production and inventory. We introduce Distributional Constrained Policy Optimization (DCPO), a novel approach for…

Machine Learning · Computer Science 2023-02-06 Jaime Sabal Bermúdez , Antonio del Rio Chanona , Calvin Tsay

Reward Constrained Policy Optimization

Solving tasks in Reinforcement Learning is no easy feat. As the goal of the agent is to maximize the accumulated reward, it often learns to exploit loopholes and misspecifications in the reward signal resulting in unwanted behavior. While…

Machine Learning · Computer Science 2018-12-27 Chen Tessler , Daniel J. Mankowitz , Shie Mannor

Adversarial Constrained Policy Optimization: Improving Constrained Reinforcement Learning by Adapting Budgets

Constrained reinforcement learning has achieved promising progress in safety-critical fields where both rewards and constraints are considered. However, constrained reinforcement learning methods face challenges in striking the right…

Machine Learning · Computer Science 2024-10-29 Jianmina Ma , Jingtian Ji , Yue Gao

Asset Allocation: From Markowitz to Deep Reinforcement Learning

Asset allocation is an investment strategy that aims to balance risk and reward by constantly redistributing the portfolio's assets according to certain goals, risk tolerance, and investment horizon. Unfortunately, there is no simple…

Portfolio Management · Quantitative Finance 2022-08-16 Ricard Durall

Projection-Based Constrained Policy Optimization

We consider the problem of learning control policies that optimize a reward function while satisfying constraints due to considerations of safety, fairness, or other costs. We propose a new algorithm, Projection-Based Constrained Policy…

Machine Learning · Computer Science 2020-10-08 Tsung-Yen Yang , Justinian Rosca , Karthik Narasimhan , Peter J. Ramadge

A new perspective on classification: optimally allocating limited resources to uncertain tasks

A central problem in business concerns the optimal allocation of limited resources to a set of available tasks, where the payoff of these tasks is inherently uncertain. In credit card fraud detection, for instance, a bank can only assign a…

Machine Learning · Computer Science 2022-02-10 Toon Vanderschueren , Bart Baesens , Tim Verdonck , Wouter Verbeke

Harnessing Data from Clustered LQR Systems: Personalized and Collaborative Policy Optimization

It is known that reinforcement learning (RL) is data-hungry. To improve sample-efficiency of RL, it has been proposed that the learning algorithm utilize data from 'approximately similar' processes. However, since the process models are…

Machine Learning · Computer Science 2025-11-24 Vinay Kanakeri , Shivam Bajaj , Ashwin Verma , Vijay Gupta , Aritra Mitra

Optimal Resource Allocation with Semi-Bandit Feedback

We study a sequential resource allocation problem involving a fixed number of recurring jobs. At each time-step the manager should distribute available resources among the jobs in order to maximise the expected number of completed jobs.…

Machine Learning · Computer Science 2014-06-17 Tor Lattimore , Koby Crammer , Csaba Szepesvári

Autoregressive Policies for Continuous Control Deep Reinforcement Learning

Reinforcement learning algorithms rely on exploration to discover new behaviors, which is typically achieved by following a stochastic policy. In continuous control tasks, policies with a Gaussian distribution have been widely adopted.…

Machine Learning · Computer Science 2019-03-28 Dmytro Korenkevych , A. Rupam Mahmood , Gautham Vasan , James Bergstra

Bayesian Optimization for CVaR-based portfolio optimization

Optimal portfolio allocation is often formulated as a constrained risk problem, where one aims to minimize a risk measure subject to some performance constraints. This paper presents new Bayesian Optimization algorithms for such constrained…

Portfolio Management · Quantitative Finance 2025-03-25 Robert Millar , Jinglai Li

Learning Deterministic Policies with Policy Gradients in Constrained Markov Decision Processes

Constrained Reinforcement Learning (CRL) addresses sequential decision-making problems where agents are required to achieve goals by maximizing the expected return while meeting domain-specific constraints. In this setting, policy-based…

Machine Learning · Computer Science 2025-06-09 Alessandro Montenegro , Leonardo Cesani , Marco Mussi , Matteo Papini , Alberto Maria Metelli

Deep Reinforcement Learning for Resource Allocation in Business Processes

Assigning resources in business processes execution is a repetitive task that can be effectively automated. However, different automation methods may give varying results that may not be optimal. Proper resource allocation is crucial as it…

Machine Learning · Computer Science 2021-04-02 Kamil Żbikowski , Michał Ostapowicz , Piotr Gawrysiak

Constrained Policy Optimization

For many applications of reinforcement learning it can be more convenient to specify both a reward function and constraints, rather than trying to design behavior through the reward function. For example, systems that physically interact…

Machine Learning · Computer Science 2017-05-31 Joshua Achiam , David Held , Aviv Tamar , Pieter Abbeel

AlphaRank: An Artificial Intelligence Approach for Ranking and Selection Problems

We introduce AlphaRank, an artificial intelligence approach to address the fixed-budget ranking and selection (R&S) problems. We formulate the sequential sampling decision as a Markov decision process and propose a Monte Carlo…

Machine Learning · Computer Science 2024-02-05 Ruihan Zhou , L. Jeff Hong , Yijie Peng

Reinforcement Learning-assisted Constraint Relaxation for Constrained Expensive Optimization

Constraint handling plays a key role in solving realistic complex optimization problems. Though intensively discussed in the last few decades, existing constraint handling techniques predominantly rely on human experts' designs, which more…

Neural and Evolutionary Computing · Computer Science 2026-02-03 Qianhao Zhu , Sijie Ma , Zeyuan Ma , Hongshu Guo , Yue-Jiao Gong

Adaptive Learning for the Resource-Constrained Classification Problem

Resource-constrained classification tasks are common in real-world applications such as allocating tests for disease diagnosis, hiring decisions when filling a limited number of positions, and defect detection in manufacturing settings…

Machine Learning · Computer Science 2023-11-22 Danit Shifman Abukasis , Izack Cohen , Xiaochen Xian , Kejun Huang , Gonen Singer