Related papers: Predictive Coding for Decision Transformer

Future-conditioned Unsupervised Pretraining for Decision Transformer

Recent research in offline reinforcement learning (RL) has demonstrated that return-conditioned supervised learning is a powerful paradigm for decision-making problems. While promising, return conditioning is limited to training data…

Machine Learning · Computer Science 2023-05-29 Zhihui Xie , Zichuan Lin , Deheng Ye , Qiang Fu , Wei Yang , Shuai Li

Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL

Recent works have shown that tackling offline reinforcement learning (RL) with a conditional policy produces promising results. The Decision Transformer (DT) combines the conditional policy approach and a transformer architecture, showing…

Machine Learning · Computer Science 2023-05-26 Taku Yamagata , Ahmed Khalil , Raul Santos-Rodriguez

Return-Aligned Decision Transformer

Traditional approaches in offline reinforcement learning aim to learn the optimal policy that maximizes the cumulative reward, also known as return. It is increasingly important to adjust the performance of AI agents to meet human…

Machine Learning · Computer Science 2025-06-23 Tsunehiko Tanaka , Kenshi Abe , Kaito Ariu , Tetsuro Morimura , Edgar Simo-Serra

Rethinking Decision Transformer via Hierarchical Reinforcement Learning

Decision Transformer (DT) is an innovative algorithm leveraging recent advances of the transformer architecture in reinforcement learning (RL). However, a notable limitation of DT is its reliance on recalling trajectories from datasets,…

Machine Learning · Computer Science 2023-11-02 Yi Ma , Chenjun Xiao , Hebin Liang , Jianye Hao

Pre-trained Language Models Improve the Few-shot Prompt Ability of Decision Transformer

Decision Transformer (DT) has emerged as a promising class of algorithms in offline reinforcement learning (RL) tasks, leveraging pre-collected datasets and Transformer's capability to model long sequences. Recent works have demonstrated…

Machine Learning · Computer Science 2025-12-03 Yu Yang , Pan Xu

Decision Transformer as a Foundation Model for Partially Observable Continuous Control

Closed-loop control of nonlinear dynamical systems with partial-state observability demands expert knowledge of a diverse, less standardized set of theoretical tools. Moreover, it requires a delicate integration of controller and estimator…

Systems and Control · Electrical Eng. & Systems 2024-04-04 Xiangyuan Zhang , Weichao Mao , Haoran Qiu , Tamer Başar

ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning

Decision Transformer (DT), which employs expressive sequence modeling techniques to perform action generation, has emerged as a promising approach to offline policy optimization. However, DT generates actions conditioned on a desired future…

Machine Learning · Computer Science 2024-06-25 Chen-Xiao Gao , Chenyang Wu , Mingjun Cao , Rui Kong , Zongzhang Zhang , Yang Yu

Goal-Conditioned Predictive Coding for Offline Reinforcement Learning

Recent work has demonstrated the effectiveness of formulating decision making as supervised learning on offline-collected trajectories. Powerful sequence models, such as GPT or BERT, are often employed to encode the trajectories. However,…

Machine Learning · Computer Science 2023-10-31 Zilai Zeng , Ce Zhang , Shijie Wang , Chen Sun

Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate Targets

Despite the recent advancements in offline reinforcement learning via supervised learning (RvS) and the success of the decision transformer (DT) architecture in various domains, DTs have fallen short in several challenging benchmarks. The…

Machine Learning · Computer Science 2023-11-21 Anirudhan Badrinath , Yannis Flet-Berliac , Allen Nie , Emma Brunskill

Critic-Guided Decision Transformer for Offline Reinforcement Learning

Recent advancements in offline reinforcement learning (RL) have underscored the capabilities of Return-Conditioned Supervised Learning (RCSL), a paradigm that learns the action distribution based on target returns for each state in a…

Machine Learning · Computer Science 2023-12-22 Yuanfu Wang , Chao Yang , Ying Wen , Yu Liu , Yu Qiao

A Comparison Between Decision Transformers and Traditional Offline Reinforcement Learning Algorithms

The field of Offline Reinforcement Learning (RL) aims to derive effective policies from pre-collected datasets without active environment interaction. While traditional offline RL algorithms like Conservative Q-Learning (CQL) and Implicit…

Machine Learning · Computer Science 2025-11-21 Ali Murtaza Caunhye , Asad Jeewa

User Retention-oriented Recommendation with Decision Transformer

Improving user retention with reinforcement learning~(RL) has attracted increasing attention due to its significant importance in boosting user engagement. However, training the RL policy from scratch without hurting users' experience is…

Information Retrieval · Computer Science 2023-03-14 Kesen Zhao , Lixin Zou , Xiangyu Zhao , Maolin Wang , Dawei yin

Decision Transformers for Wireless Communications: A New Paradigm of Resource Management

As the next generation of mobile systems evolves, artificial intelligence (AI) is expected to deeply integrate with wireless communications for resource management in variable environments. In particular, deep reinforcement learning (DRL)…

Signal Processing · Electrical Eng. & Systems 2024-10-15 Jie Zhang , Jun Li , Long Shi , Zhe Wang , Shi Jin , Wen Chen , H. Vincent Poor

Supervised Pretraining Can Learn In-Context Reinforcement Learning

Large transformer models trained on diverse datasets have shown a remarkable ability to learn in-context, achieving high few-shot performance on tasks they were not explicitly trained to solve. In this paper, we study the in-context…

Machine Learning · Computer Science 2023-06-27 Jonathan N. Lee , Annie Xie , Aldo Pacchiano , Yash Chandak , Chelsea Finn , Ofir Nachum , Emma Brunskill

Online Decision Transformer

Recent work has shown that offline reinforcement learning (RL) can be formulated as a sequence modeling problem (Chen et al., 2021; Janner et al., 2021) and solved via approaches similar to large-scale language modeling. However, any…

Machine Learning · Computer Science 2022-07-14 Qinqing Zheng , Amy Zhang , Aditya Grover

Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making

The recent success of Transformer in natural language processing has sparked its use in various domains. In offline reinforcement learning (RL), Decision Transformer (DT) is emerging as a promising model based on Transformer. However, we…

Machine Learning · Computer Science 2024-05-31 Jeonghye Kim , Suyoung Lee , Woojun Kim , Youngchul Sung

DODT: Enhanced Online Decision Transformer Learning through Dreamer's Actor-Critic Trajectory Forecasting

Advancements in reinforcement learning have led to the development of sophisticated models capable of learning complex decision-making tasks. However, efficiently integrating world models with decision transformers remains a challenge. In…

Machine Learning · Computer Science 2024-10-16 Eric Hanchen Jiang , Zhi Zhang , Dinghuai Zhang , Andrew Lizarraga , Chenheng Xu , Yasi Zhang , Siyan Zhao , Zhengjie Xu , Peiyu Yu , Yuer Tang , Deqian Kong , Ying Nian Wu

Temporal Logic Specification-Conditioned Decision Transformer for Offline Safe Reinforcement Learning

Offline safe reinforcement learning (RL) aims to train a constraint satisfaction policy from a fixed dataset. Current state-of-the-art approaches are based on supervised learning with a conditioned policy. However, these approaches fall…

Machine Learning · Computer Science 2025-01-28 Zijian Guo , Weichao Zhou , Wenchao Li

Decoupling Return-to-Go for Efficient Decision Transformer

The Decision Transformer (DT) has established a powerful sequence modeling approach to offline reinforcement learning. It conditions its action predictions on Return-to-Go (RTG), using it both to distinguish trajectory quality during…

Artificial Intelligence · Computer Science 2026-01-23 Yongyi Wang , Hanyu Liu , Lingfeng Li , Bozhou Chen , Ang Li , Qirui Zheng , Xionghui Yang , Wenxin Li

Beyond Autoregressive RTG: Conditioning via Injection Outside Sequential Modeling in Decision Transformer

Decision Transformer (DT) formulates offline reinforcement learning as autoregressive sequence modeling, achieving promising results by predicting actions from a sequence of Return-to-Go (RTG), state, and action tokens. However, RTG is a…

Machine Learning · Computer Science 2026-05-08 Yongyi Wang , Hanyu Liu , Lingfeng Li , Bozhou Chen , Ang Li , Qirui Zheng , Xionghui Yang , Chucai Wang , Wenxin Li