Related papers: Sample-Efficient, Exploration-Based Policy Optimis…

Learning Vehicle Routing Problems using Policy Optimisation

Deep reinforcement learning (DRL) has been used to learn effective heuristics for solving complex combinatorial optimisation problem via policy networks and have demonstrated promising performance. Existing works have focused on solving…

Machine Learning · Computer Science 2020-12-25 Nasrin Sultana , Jeffrey Chan , A. K. Qin , Tabinda Sarwar

Reinforcement Learning for Solving the Vehicle Routing Problem

We present an end-to-end framework for solving the Vehicle Routing Problem (VRP) using reinforcement learning. In this approach, we train a single model that finds near-optimal solutions for problem instances sampled from a given…

Artificial Intelligence · Computer Science 2018-05-23 Mohammadreza Nazari , Afshin Oroojlooy , Lawrence V. Snyder , Martin Takáč

Combining Reinforcement Learning and Optimal Transport for the Traveling Salesman Problem

The traveling salesman problem is a fundamental combinatorial optimization problem with strong exact algorithms. However, as problems scale up, these exact algorithms fail to provide a solution in a reasonable time. To resolve this, current…

Machine Learning · Computer Science 2025-01-09 Yong Liang Goh , Wee Sun Lee , Xavier Bresson , Thomas Laurent , Nicholas Lim

Reinforcement Learning with Combinatorial Actions: An Application to Vehicle Routing

Value-function-based methods have long played an important role in reinforcement learning. However, finding the best next action given a value function of arbitrary complexity is nontrivial when the action space is too large for…

Machine Learning · Computer Science 2020-10-26 Arthur Delarue , Ross Anderson , Christian Tjandraatmadja

Reinforcement Learning for Solving Stochastic Vehicle Routing Problem with Time Windows

This paper introduces a reinforcement learning approach to optimize the Stochastic Vehicle Routing Problem with Time Windows (SVRP), focusing on reducing travel costs in goods delivery. We develop a novel SVRP formulation that accounts for…

Artificial Intelligence · Computer Science 2024-02-16 Zangir Iklassov , Ikboljon Sobirov , Ruben Solozabal , Martin Takac

Solve routing problems with a residual edge-graph attention neural network

For NP-hard combinatorial optimization problems, it is usually difficult to find high-quality solutions in polynomial time. The design of either an exact algorithm or an approximate algorithm for these problems often requires significantly…

Machine Learning · Computer Science 2021-05-07 Kun Lei , Peng Guo , Yi Wang , Xiao Wu , Wenchao Zhao

Constrained Combinatorial Optimization with Reinforcement Learning

This paper presents a framework to tackle constrained combinatorial optimization problems using deep Reinforcement Learning (RL). To this end, we extend the Neural Combinatorial Optimization (NCO) theory in order to deal with constraints in…

Machine Learning · Computer Science 2020-06-23 Ruben Solozabal , Josu Ceberio , Martin Takáč

Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization

Combinatorial optimization has found applications in numerous fields, from aerospace to transportation planning and economics. The goal is to find an optimal solution among a finite set of possibilities. The well-known challenge one faces…

Artificial Intelligence · Computer Science 2020-06-03 Quentin Cappart , Thierry Moisan , Louis-Martin Rousseau , Isabeau Prémont-Schwarz , Andre Cire

Learning Improvement Heuristics for Solving Routing Problems

Recent studies in using deep learning to solve routing problems focus on construction heuristics, the solutions of which are still far from optimality. Improvement heuristics have great potential to narrow this gap by iteratively refining a…

Artificial Intelligence · Computer Science 2020-05-12 Yaoxin Wu , Wen Song , Zhiguang Cao , Jie Zhang , Andrew Lim

Exploration via Planning for Information about the Optimal Trajectory

Many potential applications of reinforcement learning (RL) are stymied by the large numbers of samples required to learn an effective policy. This is especially true when applying RL to real-world control tasks, e.g. in the sciences or…

Machine Learning · Computer Science 2022-10-11 Viraj Mehta , Ian Char , Joseph Abbate , Rory Conlin , Mark D. Boyer , Stefano Ermon , Jeff Schneider , Willie Neiswanger

Reinforcement Learning for Solving Stochastic Vehicle Routing Problem

This study addresses a gap in the utilization of Reinforcement Learning (RL) and Machine Learning (ML) techniques in solving the Stochastic Vehicle Routing Problem (SVRP) that involves the challenging task of optimizing vehicle routes under…

Artificial Intelligence · Computer Science 2023-11-15 Zangir Iklassov , Ikboljon Sobirov , Ruben Solozabal , Martin Takac

Self-Improvement for Neural Combinatorial Optimization: Sample without Replacement, but Improvement

Current methods for end-to-end constructive neural combinatorial optimization usually train a policy using behavior cloning from expert solutions or policy gradient methods from reinforcement learning. While behavior cloning is…

Machine Learning · Computer Science 2024-11-05 Jonathan Pirnay , Dominik G. Grimm

Reinforcement Learning to Solve NP-hard Problems: an Application to the CVRP

In this paper, we evaluate the use of Reinforcement Learning (RL) to solve a classic combinatorial optimization problem: the Capacitated Vehicle Routing Problem (CVRP). We formalize this problem in the RL framework and compare two of the…

Artificial Intelligence · Computer Science 2022-01-17 Leo Ardon

Supervised Permutation Invariant Networks for Solving the CVRP with Bounded Fleet Size

Learning to solve combinatorial optimization problems, such as the vehicle routing problem, offers great computational advantages over classical operations research solvers and heuristics. The recently developed deep reinforcement learning…

Machine Learning · Computer Science 2022-01-06 Daniela Thyssens , Jonas Falkner , Lars Schmidt-Thieme

Efficient Sampling-Based Maximum Entropy Inverse Reinforcement Learning with Application to Autonomous Driving

In the past decades, we have witnessed significant progress in the domain of autonomous driving. Advanced techniques based on optimization and reinforcement learning (RL) become increasingly powerful at solving the forward problem: given…

Robotics · Computer Science 2020-06-25 Zheng Wu , Liting Sun , Wei Zhan , Chenyu Yang , Masayoshi Tomizuka

Learning to Sample: Reinforcement Learning-Guided Sampling for Autonomous Vehicle Motion Planning

Sampling-based motion planning is a well-established approach in autonomous driving, valued for its modularity and analytical tractability. In complex urban scenarios, however, uniform or heuristic sampling often produces many infeasible or…

Robotics · Computer Science 2026-03-24 Korbinian Moller , Roland Stroop , Mattia Piccinini , Alexander Langmann , Johannes Betz

SOLO: Search Online, Learn Offline for Combinatorial Optimization Problems

We study combinatorial problems with real world applications such as machine scheduling, routing, and assignment. We propose a method that combines Reinforcement Learning (RL) and planning. This method can equally be applied to both the…

Machine Learning · Computer Science 2021-05-19 Joel Oren , Chana Ross , Maksym Lefarov , Felix Richter , Ayal Taitler , Zohar Feldman , Christian Daniel , Dotan Di Castro

A Provably Efficient Sample Collection Strategy for Reinforcement Learning

One of the challenges in online reinforcement learning (RL) is that the agent needs to trade off the exploration of the environment and the exploitation of the samples to optimize its behavior. Whether we optimize for regret, sample…

Machine Learning · Computer Science 2021-11-19 Jean Tarbouriech , Matteo Pirotta , Michal Valko , Alessandro Lazaric

Reinforcement Learning in Real Option Models

We investigate an entropy-regularized reinforcement learning (RL) approach to optimal stopping problems motivated by real option models. Classical stopping rules are strict and non-randomized, limiting natural exploration in RL settings. To…

Optimization and Control · Mathematics 2026-02-18 Jodi Dianetti , Giorgio Ferrari , Renyuan Xu

Deep Reinforcement Learning for Combinatorial Optimization: Covering Salesman Problems

This paper introduces a new deep learning approach to approximately solve the Covering Salesman Problem (CSP). In this approach, given the city locations of a CSP as input, a deep neural network model is designed to directly output the…

Neural and Evolutionary Computing · Computer Science 2021-09-15 Kaiwen Li , Tao Zhang , Rui Wang Yuheng Wang , Yi Han