Related papers: Graph Value Iteration

Reinforcement learning on graphs: A survey

Graph mining tasks arise from many different application domains, ranging from social networks, transportation to E-commerce, etc., which have been receiving great attention from the theoretical and algorithmic design communities in recent…

Machine Learning · Computer Science 2023-01-18 Mingshuo Nie , Dongming Chen , Dongqi Wang

Reinforcement Learning Discovers Efficient Decentralized Graph Path Search Strategies

Graph path search is a classic computer science problem that has been recently approached with Reinforcement Learning (RL) due to its potential to outperform prior methods. Existing RL techniques typically assume a global view of the…

Machine Learning · Computer Science 2024-11-27 Alexei Pisacane , Victor-Alexandru Darvariu , Mirco Musolesi

Test-Time Graph Search for Goal-Conditioned Reinforcement Learning

Offline goal-conditioned reinforcement learning (GCRL) often struggles with long-horizon tasks, where errors in value estimation accumulate and produce unreliable policies. It is typically assumed that effective long-term planning is…

Machine Learning · Computer Science 2026-05-26 Evgenii Opryshko , Junwei Quan , Claas Voelcker , Yilun Du , Igor Gilitschenski

Graph neural induction of value iteration

Many reinforcement learning tasks can benefit from explicit planning based on an internal model of the environment. Previously, such planning components have been incorporated through a neural network that partially aligns with the…

Machine Learning · Computer Science 2020-09-29 Andreea Deac , Pierre-Luc Bacon , Jian Tang

Highway Graph to Accelerate Reinforcement Learning

Reinforcement Learning (RL) algorithms often struggle with low training efficiency. A common approach to address this challenge is integrating model-based planning algorithms, such as Monte Carlo Tree Search (MCTS) or Value Iteration (VI),…

Machine Learning · Computer Science 2025-01-08 Zidu Yin , Zhen Zhang , Dong Gong , Stefano V. Albrecht , Javen Q. Shi

Graph-GRPO: Training Graph Flow Models with Reinforcement Learning

Graph generation is a fundamental task with broad applications, such as drug discovery. Recently, discrete flow matching-based graph generation, \aka, graph flow model (GFM), has emerged due to its superior performance and flexible…

Machine Learning · Computer Science 2026-03-12 Baoheng Zhu , Deyu Bo , Delvin Ce Zhang , Xiao Wang

Reinforcement Learning Based Query Vertex Ordering Model for Subgraph Matching

Subgraph matching is a fundamental problem in various fields that use graph structured data. Subgraph matching algorithms enumerate all isomorphic embeddings of a query graph q in a data graph G. An important branch of matching algorithms…

Machine Learning · Computer Science 2022-04-01 Hanchen Wang , Ying Zhang , Lu Qin , Wei Wang , Wenjie Zhang , Xuemin Lin

Hierarchical Reinforcement Learning with Targeted Causal Interventions

Hierarchical reinforcement learning (HRL) improves the efficiency of long-horizon reinforcement-learning tasks with sparse rewards by decomposing the task into a hierarchy of subgoals. The main challenge of HRL is efficient discovery of the…

Machine Learning · Computer Science 2025-07-08 Sadegh Khorasani , Saber Salehkaleybar , Negar Kiyavash , Matthias Grossglauser

RLGT: A reinforcement learning framework for extremal graph theory

Reinforcement learning (RL) is a subfield of machine learning that focuses on developing models that can autonomously learn optimal decision-making strategies over time. In a recent pioneering paper, Wagner demonstrated how the Deep…

Machine Learning · Computer Science 2026-04-15 Ivan Damnjanović , Uroš Milivojević , Irena Đorđević , Dragan Stevanović

Reinforcement Learning with Intrinsically Motivated Feedback Graph for Lost-sales Inventory Control

Reinforcement learning (RL) has proven to be well-performed and general-purpose in the inventory control (IC). However, further improvement of RL algorithms in the IC domain is impeded due to two limitations of online experience. First,…

Machine Learning · Computer Science 2025-02-18 Zifan Liu , Xinran Li , Shibo Chen , Gen Li , Jiashuo Jiang , Jun Zhang

Goal Exploration Augmentation via Pre-trained Skills for Sparse-Reward Long-Horizon Goal-Conditioned Reinforcement Learning

Reinforcement learning (RL) often struggles to accomplish a sparse-reward long-horizon task in a complex environment. Goal-conditioned reinforcement learning (GCRL) has been employed to tackle this difficult problem via a curriculum of…

Machine Learning · Computer Science 2023-12-20 Lisheng Wu , Ke Chen

Reinforcement Learning to Rank Using Coarse-grained Rewards

Learning to rank (LTR) plays a crucial role in various Information Retrieval (IR) tasks. Although supervised LTR methods based on fine-grained relevance labels (e.g., document-level annotations) have achieved significant success, their…

Information Retrieval · Computer Science 2025-08-21 Yiteng Tu , Zhichao Xu , Tao Yang , Weihang Su , Yujia Zhou , Yiqun Liu , Fen Lin , Qin Liu , Qingyao Ai

Knowledge Graph Reasoning with Self-supervised Reinforcement Learning

Reinforcement learning (RL) is an effective method of finding reasoning pathways in incomplete knowledge graphs (KGs). To overcome the challenges of a large action space, a self-supervised pre-training method is proposed to warm up the…

Computation and Language · Computer Science 2025-04-17 Ying Ma , Owen Burns , Mingqiu Wang , Gang Li , Nan Du , Laurent El Shafey , Liqiang Wang , Izhak Shafran , Hagen Soltau

Causal Discovery with Reinforcement Learning

Discovering causal structure among a set of variables is a fundamental problem in many empirical sciences. Traditional score-based casual discovery methods rely on various local heuristics to search for a Directed Acyclic Graph (DAG)…

Machine Learning · Computer Science 2020-06-09 Shengyu Zhu , Ignavier Ng , Zhitang Chen

Searching for Plannable Domains can Speed up Reinforcement Learning

Reinforcement learning (RL) involves sequential decision making in uncertain environments. The aim of the decision-making agent is to maximize the benefit of acting in its environment over an extended period of time. Finding an optimal…

Artificial Intelligence · Computer Science 2007-05-23 Istvan Szita , Balint Takacs , Andras Lorincz

Learning Two-Step Hybrid Policy for Graph-Based Interpretable Reinforcement Learning

We present a two-step hybrid reinforcement learning (RL) policy that is designed to generate interpretable and robust hierarchical policies on the RL problem with graph-based input. Unlike prior deep reinforcement learning policies…

Machine Learning · Computer Science 2022-10-20 Tongzhou Mu , Kaixiang Lin , Feiyang Niu , Govind Thattai

Graph augmented Deep Reinforcement Learning in the GameRLand3D environment

We address planning and navigation in challenging 3D video games featuring maps with disconnected regions reachable by agents using special actions. In this setting, classical symbolic planners are not applicable or difficult to adapt. We…

Machine Learning · Computer Science 2021-12-23 Edward Beeching , Maxim Peter , Philippe Marcotte , Jilles Debangoye , Olivier Simonin , Joshua Romoff , Christian Wolf

What Can You Do When You Have Zero Rewards During RL?

Reinforcement learning (RL) with outcome-based rewards has proven effective for improving large language models (LLMs) on complex reasoning tasks. However, its success often depends on the base model occasionally sampling correct solutions.…

Machine Learning · Computer Science 2025-10-07 Jatin Prakash , Anirudh Buvanesh

Reinforcement Learning in Credit Scoring and Underwriting

This paper proposes a novel reinforcement learning (RL) framework for credit underwriting that tackles ungeneralizable contextual challenges. We adapt RL principles for credit scoring, incorporating action space renewal and multi-choice…

Machine Learning · Statistics 2024-06-28 Seksan Kiatsupaibul , Pakawan Chansiripas , Pojtanut Manopanjasiri , Kantapong Visantavarakul , Zheng Wen

Combining Subgoal Graphs with Reinforcement Learning to Build a Rational Pathfinder

In this paper, we present a hierarchical path planning framework called SG-RL (subgoal graphs-reinforcement learning), to plan rational paths for agents maneuvering in continuous and uncertain environments. By "rational", we mean (1)…

Artificial Intelligence · Computer Science 2019-04-05 Junjie Zeng , Long Qin , Yue Hu , Cong Hu , Quanjun Yin