Related papers: Generative Sequential Notification Optimization vi…

A Comparison Between Decision Transformers and Traditional Offline Reinforcement Learning Algorithms

The field of Offline Reinforcement Learning (RL) aims to derive effective policies from pre-collected datasets without active environment interaction. While traditional offline RL algorithms like Conservative Q-Learning (CQL) and Implicit…

Machine Learning · Computer Science 2025-11-21 Ali Murtaza Caunhye , Asad Jeewa

Rethinking Decision Transformer via Hierarchical Reinforcement Learning

Decision Transformer (DT) is an innovative algorithm leveraging recent advances of the transformer architecture in reinforcement learning (RL). However, a notable limitation of DT is its reliance on recalling trajectories from datasets,…

Machine Learning · Computer Science 2023-11-02 Yi Ma , Chenjun Xiao , Hebin Liang , Jianye Hao

Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL

Recent works have shown that tackling offline reinforcement learning (RL) with a conditional policy produces promising results. The Decision Transformer (DT) combines the conditional policy approach and a transformer architecture, showing…

Machine Learning · Computer Science 2023-05-26 Taku Yamagata , Ahmed Khalil , Raul Santos-Rodriguez

Multi-objective Optimization of Notifications Using Offline Reinforcement Learning

Mobile notification systems play a major role in a variety of applications to communicate, send alerts and reminders to the users to inform them about news, events or messages. In this paper, we formulate the near-real-time notification…

Machine Learning · Computer Science 2022-07-08 Prakruthi Prabhakar , Yiping Yuan , Guangyu Yang , Wensheng Sun , Ajith Muralidharan

User Retention-oriented Recommendation with Decision Transformer

Improving user retention with reinforcement learning~(RL) has attracted increasing attention due to its significant importance in boosting user engagement. However, training the RL policy from scratch without hurting users' experience is…

Information Retrieval · Computer Science 2023-03-14 Kesen Zhao , Lixin Zou , Xiangyu Zhao , Maolin Wang , Dawei yin

DRDT3: Diffusion-Refined Decision Test-Time Training Model

Decision Transformer (DT), a trajectory modelling method, has shown competitive performance compared to traditional offline reinforcement learning (RL) approaches on various classic control tasks. However, it struggles to learn optimal…

Machine Learning · Computer Science 2025-09-18 Xingshuai Huang , Di Wu , Benoit Boulet

Decision Transformers for Wireless Communications: A New Paradigm of Resource Management

As the next generation of mobile systems evolves, artificial intelligence (AI) is expected to deeply integrate with wireless communications for resource management in variable environments. In particular, deep reinforcement learning (DRL)…

Signal Processing · Electrical Eng. & Systems 2024-10-15 Jie Zhang , Jun Li , Long Shi , Zhe Wang , Shi Jin , Wen Chen , H. Vincent Poor

Offline Reinforcement Learning for Mobile Notifications

Mobile notification systems have taken a major role in driving and maintaining user engagement for online platforms. They are interesting recommender systems to machine learning practitioners with more sequential and long-term feedback…

Machine Learning · Computer Science 2022-02-09 Yiping Yuan , Ajith Muralidharan , Preetam Nandy , Miao Cheng , Prakruthi Prabhakar

When should we prefer Decision Transformers for Offline Reinforcement Learning?

Offline reinforcement learning (RL) allows agents to learn effective, return-maximizing policies from a static dataset. Three popular algorithms for offline RL are Conservative Q-Learning (CQL), Behavior Cloning (BC), and Decision…

Machine Learning · Computer Science 2024-03-13 Prajjwal Bhargava , Rohan Chitnis , Alborz Geramifard , Shagun Sodhani , Amy Zhang

Temporal Logic Specification-Conditioned Decision Transformer for Offline Safe Reinforcement Learning

Offline safe reinforcement learning (RL) aims to train a constraint satisfaction policy from a fixed dataset. Current state-of-the-art approaches are based on supervised learning with a conditioned policy. However, these approaches fall…

Machine Learning · Computer Science 2025-01-28 Zijian Guo , Weichao Zhou , Wenchao Li

Adjusting the Output of Decision Transformer with Action Gradient

Decision Transformer (DT), which integrates reinforcement learning (RL) with the transformer model, introduces a novel approach to offline RL. Unlike classical algorithms that take maximizing cumulative discounted rewards as objective, DT…

Machine Learning · Computer Science 2025-10-08 Rui Lin , Yiwen Zhang , Zhicheng Peng , Minghao Lyu

GNN-DT: Graph Neural Network Enhanced Decision Transformer for Efficient Optimization in Dynamic Environments

Reinforcement Learning (RL) methods used for solving real-world optimization problems often involve dynamic state-action spaces, larger scale, and sparse rewards, leading to significant challenges in convergence, scalability, and efficient…

Machine Learning · Computer Science 2025-09-29 Stavros Orfanoudakis , Nanda Kishor Panda , Peter Palensky , Pedro P. Vergara

Online Decision Transformer

Recent work has shown that offline reinforcement learning (RL) can be formulated as a sequence modeling problem (Chen et al., 2021; Janner et al., 2021) and solved via approaches similar to large-scale language modeling. However, any…

Machine Learning · Computer Science 2022-07-14 Qinqing Zheng , Amy Zhang , Aditya Grover

Decoupling Return-to-Go for Efficient Decision Transformer

The Decision Transformer (DT) has established a powerful sequence modeling approach to offline reinforcement learning. It conditions its action predictions on Return-to-Go (RTG), using it both to distinguish trajectory quality during…

Artificial Intelligence · Computer Science 2026-01-23 Yongyi Wang , Hanyu Liu , Lingfeng Li , Bozhou Chen , Ang Li , Qirui Zheng , Xionghui Yang , Wenxin Li

In-Context Decision Transformer: Reinforcement Learning via Hierarchical Chain-of-Thought

In-context learning is a promising approach for offline reinforcement learning (RL) to handle online tasks, which can be achieved by providing task prompts. Recent works demonstrated that in-context RL could emerge with self-improvement in…

Machine Learning · Computer Science 2024-06-03 Sili Huang , Jifeng Hu , Hechang Chen , Lichao Sun , Bo Yang

Quantum Decision Transformers (QDT): Synergistic Entanglement and Interference for Offline Reinforcement Learning

Offline reinforcement learning enables policy learning from pre-collected datasets without environment interaction, but existing Decision Transformer (DT) architectures struggle with long-horizon credit assignment and complex state-action…

Machine Learning · Computer Science 2025-12-18 Abraham Itzhak Weinberg

Q-value Regularized Decision ConvFormer for Offline Reinforcement Learning

As a data-driven paradigm, offline reinforcement learning (Offline RL) has been formulated as sequence modeling, where the Decision Transformer (DT) has demonstrated exceptional capabilities. Unlike previous reinforcement learning methods…

Machine Learning · Computer Science 2024-09-13 Teng Yan , Zhendong Ruan , Yaobang Cai , Yu Han , Wenxian Li , Yang Zhang

Enhancing Decision Transformer with Diffusion-Based Trajectory Branch Generation

Decision Transformer (DT) can learn effective policy from offline datasets by converting the offline reinforcement learning (RL) into a supervised sequence modeling task, where the trajectory elements are generated auto-regressively…

Machine Learning · Computer Science 2024-11-19 Zhihong Liu , Long Qian , Zeyang Liu , Lipeng Wan , Xingyu Chen , Xuguang Lan

Predictive Coding for Decision Transformer

Recent work in offline reinforcement learning (RL) has demonstrated the effectiveness of formulating decision-making as return-conditioned supervised learning. Notably, the decision transformer (DT) architecture has shown promise across…

Machine Learning · Computer Science 2025-04-04 Tung M. Luu , Donghoon Lee , Chang D. Yoo

Return Augmented Decision Transformer for Off-Dynamics Reinforcement Learning

We study offline off-dynamics reinforcement learning (RL) to utilize data from an easily accessible source domain to enhance policy learning in a target domain with limited data. Our approach centers on return-conditioned supervised…

Machine Learning · Computer Science 2026-03-03 Ruhan Wang , Yu Yang , Zhishuai Liu , Dongruo Zhou , Pan Xu