Related papers: Reinforcement Learning Driven Heuristic Optimizati…

Reinforcement Learning for Combinatorial Optimization: A Survey

Many traditional algorithms for solving combinatorial optimization problems involve using hand-crafted heuristics that sequentially construct a solution. Such heuristics are designed by domain experts and may often be suboptimal due to the…

Machine Learning · Computer Science 2020-12-25 Nina Mazyavkina , Sergey Sviridov , Sergei Ivanov , Evgeny Burnaev

Going Beyond Heuristics by Imposing Policy Improvement as a Constraint

In many reinforcement learning (RL) applications, augmenting the task rewards with heuristic rewards that encode human priors about how a task should be solved is crucial for achieving desirable performance. However, because such heuristics…

Machine Learning · Computer Science 2025-07-09 Chi-Chang Lee , Zhang-Wei Hong , Pulkit Agrawal

Reinforcement Learning as an Improvement Heuristic for Real-World Production Scheduling

The integration of Reinforcement Learning (RL) with heuristic methods is an emerging trend for solving optimization problems, which leverages RL's ability to learn from the data generated during the search process. One promising approach is…

Machine Learning · Computer Science 2024-09-19 Arthur Müller , Lukas Vollenkemper

Heuristic-Guided Reinforcement Learning

We provide a framework for accelerating reinforcement learning (RL) algorithms by heuristics constructed from domain knowledge or offline data. Tabula rasa RL algorithms require environment interactions or computation that scales with the…

Machine Learning · Computer Science 2021-11-23 Ching-An Cheng , Andrey Kolobov , Adith Swaminathan

Improved Reinforcement Learning Pushing Policies via Heuristic Rules

Non-prehensile pushing actions have the potential to singulate a target object from its surrounding clutter in order to facilitate the robotic grasping of the target. To address this problem we utilize a heuristic rule that moves the target…

Robotics · Computer Science 2022-09-27 Marios Kiatos , Iason Sarantopoulos , Sotiris Malassiotis , Zoe Doulgeri

POMO: Policy Optimization with Multiple Optima for Reinforcement Learning

In neural combinatorial optimization (CO), reinforcement learning (RL) can turn a deep neural net into a fast, powerful heuristic solver of NP-hard problems. This approach has a great potential in practical applications because it allows…

Machine Learning · Computer Science 2021-07-14 Yeong-Dae Kwon , Jinho Choo , Byoungjip Kim , Iljoo Yoon , Youngjune Gwon , Seungjai Min

Hybrid Reinforcement Learning Framework for Mixed-Variable Problems

Optimization problems characterized by both discrete and continuous variables are common across various disciplines, presenting unique challenges due to their complex solution landscapes and the difficulty of navigating mixed-variable…

Optimization and Control · Mathematics 2024-06-03 Haoyan Zhai , Qianli Hu , Jiangning Chen

Constrained Combinatorial Optimization with Reinforcement Learning

This paper presents a framework to tackle constrained combinatorial optimization problems using deep Reinforcement Learning (RL). To this end, we extend the Neural Combinatorial Optimization (NCO) theory in order to deal with constraints in…

Machine Learning · Computer Science 2020-06-23 Ruben Solozabal , Josu Ceberio , Martin Takáč

Heuristic Algorithm-based Action Masking Reinforcement Learning (HAAM-RL) with Ensemble Inference Method

This paper presents a novel reinforcement learning (RL) approach called HAAM-RL (Heuristic Algorithm-based Action Masking Reinforcement Learning) for optimizing the color batching re-sequencing problem in automobile painting processes. The…

Machine Learning · Computer Science 2024-03-22 Kyuwon Choi , Cheolkyun Rho , Taeyoun Kim , Daewoo Choi

Hindsight-Anchored Policy Optimization: Turning Failure into Feedback in Sparse Reward Settings

Reinforcement Learning with Verifiable Rewards (RLVR) has emerged as a promising paradigm for post-training reasoning models. However, group-based methods such as Group Relative Policy Optimization (GRPO) face a critical dilemma in…

Machine Learning · Computer Science 2026-04-07 Yuning Wu , Ke Wang , Devin Chen , Kai Wei

Preference Optimization for Combinatorial Optimization Problems

Reinforcement Learning (RL) has emerged as a powerful tool for neural combinatorial optimization, enabling models to learn heuristics that solve complex problems without requiring expert knowledge. Despite significant progress, existing RL…

Machine Learning · Computer Science 2025-05-14 Mingjun Pan , Guanquan Lin , You-Wei Luo , Bin Zhu , Zhien Dai , Lijun Sun , Chun Yuan

Heuristics for Combinatorial Optimization via Value-based Reinforcement Learning: A Unified Framework and Analysis

Since the 1990s, considerable empirical work has been carried out to train statistical models, such as neural networks (NNs), as learned heuristics for combinatorial optimization (CO) problems. When successful, such an approach eliminates…

Machine Learning · Statistics 2026-01-21 Orit Davidovich , Shimrit Shtern , Segev Wasserkrug , Nimrod Megiddo

Exploiting Symbolic Heuristics for the Synthesis of Domain-Specific Temporal Planning Guidance using Reinforcement Learning

Recent work investigated the use of Reinforcement Learning (RL) for the synthesis of heuristic guidance to improve the performance of temporal planners when a domain is fixed and a set of training problems (not plans) is given. The idea is…

Artificial Intelligence · Computer Science 2025-05-20 Irene Brugnara , Alessandro Valentini , Andrea Micheli

Towards Applicable Reinforcement Learning: Improving the Generalization and Sample Efficiency with Policy Ensemble

It is challenging for reinforcement learning (RL) algorithms to succeed in real-world applications like financial trading and logistic system due to the noisy observation and environment shifting between training and evaluation. Thus, it…

Machine Learning · Computer Science 2022-05-20 Zhengyu Yang , Kan Ren , Xufang Luo , Minghuan Liu , Weiqing Liu , Jiang Bian , Weinan Zhang , Dongsheng Li

Constrained Reinforcement Learning Under Model Mismatch

Existing studies on constrained reinforcement learning (RL) may obtain a well-performing policy in the training environment. However, when deployed in a real environment, it may easily violate constraints that were originally satisfied…

Machine Learning · Computer Science 2024-05-06 Zhongchang Sun , Sihong He , Fei Miao , Shaofeng Zou

Reinforcement Learning for Classical Planning: Viewing Heuristics as Dense Reward Generators

Recent advances in reinforcement learning (RL) have led to a growing interest in applying RL to classical planning domains or applying classical planning methods to some complex RL domains. However, the long-horizon goal-based problems…

Artificial Intelligence · Computer Science 2022-03-08 Clement Gehring , Masataro Asai , Rohan Chitnis , Tom Silver , Leslie Pack Kaelbling , Shirin Sohrabi , Michael Katz

IPO: Interior-point Policy Optimization under Constraints

In this paper, we study reinforcement learning (RL) algorithms to solve real-world decision problems with the objective of maximizing the long-term reward as well as satisfying cumulative constraints. We propose a novel first-order policy…

Machine Learning · Computer Science 2019-10-23 Yongshuai Liu , Jiaxin Ding , Xin Liu

Pass@K Policy Optimization: Solving Harder Reinforcement Learning Problems

Reinforcement Learning (RL) algorithms sample multiple n>1 solution attempts for each problem and reward them independently. This optimizes for pass@1 performance and prioritizes the strength of isolated samples at the expense of the…

Machine Learning · Computer Science 2025-12-16 Christian Walder , Deep Karkhanis

Data-Efficient Hierarchical Reinforcement Learning

Hierarchical reinforcement learning (HRL) is a promising approach to extend traditional reinforcement learning (RL) methods to solve more complex tasks. Yet, the majority of current HRL methods require careful task-specific design and…

Machine Learning · Computer Science 2018-10-08 Ofir Nachum , Shixiang Gu , Honglak Lee , Sergey Levine

Combinatorial Optimization with Policy Adaptation using Latent Space Search

Combinatorial Optimization underpins many real-world applications and yet, designing performant algorithms to solve these complex, typically NP-hard, problems remains a significant research challenge. Reinforcement Learning (RL) provides a…

Machine Learning · Computer Science 2024-05-29 Felix Chalumeau , Shikha Surana , Clement Bonnet , Nathan Grinsztajn , Arnu Pretorius , Alexandre Laterre , Thomas D. Barrett