Related papers: Pointer Networks with Q-Learning for Combinatorial…

Hybrid Pointer Networks for Traveling Salesman Problems Optimization

In this work, a novel idea is presented for combinatorial optimization problems, a hybrid network, which results in a superior outcome. We applied this method to graph pointer networks [1], expanding its capabilities to a higher level. We…

Machine Learning · Computer Science 2022-01-19 Ahmed Stohy , Heba-Tullah Abdelhakam , Sayed Ali , Mohammed Elhenawy , Abdallah A Hassan , Mahmoud Masoud , Sebastien Glaser , Andry Rakotonirainy

Preference-Based Gradient Estimation for ML-Guided Approximate Combinatorial Optimization

Combinatorial optimization (CO) problems arise across a broad spectrum of domains, including medicine, logistics, and manufacturing. While exact solutions are often computationally infeasible, many practical applications require…

Machine Learning · Computer Science 2025-05-27 Arman Mielke , Uwe Bauknecht , Thilo Strauss , Mathias Niepert

QADQN: Quantum Attention Deep Q-Network for Financial Market Prediction

Financial market prediction and optimal trading strategy development remain challenging due to market complexity and volatility. Our research in quantum finance and reinforcement learning for decision-making demonstrates the approach of…

Quantum Physics · Physics 2025-01-24 Siddhant Dutta , Nouhaila Innan , Alberto Marchisio , Sadok Ben Yahia , Muhammad Shafique

Combinatorial Optimization by Graph Pointer Networks and Hierarchical Reinforcement Learning

In this work, we introduce Graph Pointer Networks (GPNs) trained using reinforcement learning (RL) for tackling the traveling salesman problem (TSP). GPNs build upon Pointer Networks by introducing a graph embedding layer on the input,…

Machine Learning · Computer Science 2019-11-13 Qiang Ma , Suwen Ge , Danyang He , Darshan Thaker , Iddo Drori

Pointer Networks

We introduce a new neural architecture to learn the conditional probability of an output sequence with elements that are discrete tokens corresponding to positions in an input sequence. Such problems cannot be trivially addressed by…

Machine Learning · Statistics 2017-01-03 Oriol Vinyals , Meire Fortunato , Navdeep Jaitly

A Reinforcement Learning Approach to the Orienteering Problem with Time Windows

The Orienteering Problem with Time Windows (OPTW) is a combinatorial optimization problem where the goal is to maximize the total score collected from different visited locations. The application of neural network models to combinatorial…

Machine Learning · Computer Science 2021-07-01 Ricardo Gama , Hugo L. Fernandes

A Deep Reinforcement Learning Approach for Solving the Traveling Salesman Problem with Drone

Reinforcement learning has recently shown promise in learning quality solutions in many combinatorial optimization problems. In particular, the attention-based encoder-decoder models show high effectiveness on various routing problems,…

Optimization and Control · Mathematics 2022-12-06 Aigerim Bogyrbayeva , Taehyun Yoon , Hanbum Ko , Sungbin Lim , Hyokun Yun , Changhyun Kwon

Periodic Q-Learning

The use of target networks is a common practice in deep reinforcement learning for stabilizing the training; however, theoretical understanding of this technique is still limited. In this paper, we study the so-called periodic Q-learning…

Machine Learning · Computer Science 2020-02-25 Donghwan Lee , Niao He

Bridging the gap between QP-based and MPC-based RL

Reinforcement learning methods typically use Deep Neural Networks to approximate the value functions and policies underlying a Markov Decision Process. Unfortunately, DNN-based RL suffers from a lack of explainability of the resulting…

Systems and Control · Electrical Eng. & Systems 2022-05-19 Shambhuraj Sawant , Sebastien Gros

Solving the QAP by Two-Stage Graph Pointer Networks and Reinforcement Learning

Quadratic Assignment Problem (QAP) is a practical combinatorial optimization problems that has been studied for several years. Since it is NP-hard, solving large problem instances of QAP is challenging. Although heuristics can find…

Machine Learning · Computer Science 2024-04-02 Satoko Iida , Ryota Yasudo

TreeDQN: Sample-Efficient Off-Policy Reinforcement Learning for Combinatorial Optimization

A convenient approach to optimally solving combinatorial optimization tasks is the Branch-and-Bound method. Its branching heuristic can be learned to solve a large set of similar tasks. The promising results here are achieved by the…

Machine Learning · Computer Science 2026-05-22 D. Sorokin , A. Kostin , L. Savchenko , G. Gusev , A. V. Savchenko

Deep Q-Network (DQN) multi-agent reinforcement learning (MARL) for Stock Trading

This project addresses the challenge of automated stock trading, where traditional methods and direct reinforcement learning (RL) struggle with market noise, complexity, and generalization. Our proposed solution is an integrated deep…

Machine Learning · Computer Science 2025-05-08 John Christopher Tidwell , John Storm Tidwell

Deep Reinforcement Learning for Dynamic Multichannel Access in Wireless Networks

We consider a dynamic multichannel access problem, where multiple correlated channels follow an unknown joint Markov model. A user at each time slot selects a channel to transmit data and receives a reward based on the success or failure of…

Networking and Internet Architecture · Computer Science 2018-02-21 Shangxing Wang , Hanpeng Liu , Pedro Henrique Gomes , Bhaskar Krishnamachari

Deep Learning for Optimization of Trajectories for Quadrotors

This paper presents a novel learning-based trajectory planning framework for quadrotors that combines model-based optimization techniques with deep learning. Specifically, we formulate the trajectory optimization problem as a quadratic…

Robotics · Computer Science 2023-12-05 Yuwei Wu , Xiatao Sun , Igor Spasojevic , Vijay Kumar

$QD$-Learning: A Collaborative Distributed Strategy for Multi-Agent Reinforcement Learning Through Consensus + Innovations

The paper considers a class of multi-agent Markov decision processes (MDPs), in which the network agents respond differently (as manifested by the instantaneous one-stage random costs) to a global controlled state and the control actions of…

Machine Learning · Statistics 2015-06-04 Soummya Kar , Jose' M. F. Moura , H. Vincent Poor

RELS-DQN: A Robust and Efficient Local Search Framework for Combinatorial Optimization

Combinatorial optimization (CO) aims to efficiently find the best solution to NP-hard problems ranging from statistical physics to social media marketing. A wide range of CO applications can benefit from local search methods because they…

Machine Learning · Computer Science 2023-04-14 Yuanhang Shao , Tonmoy Dey , Nikola Vuckovic , Luke Van Popering , Alan Kuhnle

Vehicle Routing Problems via Quantum Graph Attention Network Deep Reinforcement Learning

The vehicle routing problem (VRP) is a fundamental NP-hard task in intelligent transportation systems with broad applications in logistics and distribution. Deep reinforcement learning (DRL) with Graph Neural Networks (GNNs) has shown…

Machine Learning · Computer Science 2025-11-20 Le Tung Giang , Vu Hoang Viet , Nguyen Xuan Tung , Trinh Van Chien , Won-Joo Hwang

Multi-objective Pointer Network for Combinatorial Optimization

Multi-objective combinatorial optimization problems (MOCOPs), one type of complex optimization problems, widely exist in various real applications. Although meta-heuristics have been successfully applied to address MOCOPs, the calculation…

Machine Learning · Computer Science 2022-04-27 Le-yang Gao , Rui Wang , Chuang Liu , Zhao-hong Jia

APQ: Joint Search for Network Architecture, Pruning and Quantization Policy

We present APQ for efficient deep learning inference on resource-constrained hardware. Unlike previous methods that separately search the neural architecture, pruning policy, and quantization policy, we optimize them in a joint manner. To…

Machine Learning · Computer Science 2020-06-16 Tianzhe Wang , Kuan Wang , Han Cai , Ji Lin , Zhijian Liu , Song Han

Attention, Learn to Solve Routing Problems!

The recently presented idea to learn heuristics for combinatorial optimization problems is promising as it can save costly development. However, to push this idea towards practical implementation, we need better models and better ways of…

Machine Learning · Statistics 2019-02-08 Wouter Kool , Herke van Hoof , Max Welling