English
Related papers

Related papers: Predictive Coding for Decision Transformer

200 papers

Recent research in offline reinforcement learning (RL) has demonstrated that return-conditioned supervised learning is a powerful paradigm for decision-making problems. While promising, return conditioning is limited to training data…

Machine Learning · Computer Science 2023-05-29 Zhihui Xie , Zichuan Lin , Deheng Ye , Qiang Fu , Wei Yang , Shuai Li

Recent works have shown that tackling offline reinforcement learning (RL) with a conditional policy produces promising results. The Decision Transformer (DT) combines the conditional policy approach and a transformer architecture, showing…

Machine Learning · Computer Science 2023-05-26 Taku Yamagata , Ahmed Khalil , Raul Santos-Rodriguez

Traditional approaches in offline reinforcement learning aim to learn the optimal policy that maximizes the cumulative reward, also known as return. It is increasingly important to adjust the performance of AI agents to meet human…

Machine Learning · Computer Science 2025-06-23 Tsunehiko Tanaka , Kenshi Abe , Kaito Ariu , Tetsuro Morimura , Edgar Simo-Serra

Decision Transformer (DT) is an innovative algorithm leveraging recent advances of the transformer architecture in reinforcement learning (RL). However, a notable limitation of DT is its reliance on recalling trajectories from datasets,…

Machine Learning · Computer Science 2023-11-02 Yi Ma , Chenjun Xiao , Hebin Liang , Jianye Hao

Decision Transformer (DT) has emerged as a promising class of algorithms in offline reinforcement learning (RL) tasks, leveraging pre-collected datasets and Transformer's capability to model long sequences. Recent works have demonstrated…

Machine Learning · Computer Science 2025-12-03 Yu Yang , Pan Xu

Closed-loop control of nonlinear dynamical systems with partial-state observability demands expert knowledge of a diverse, less standardized set of theoretical tools. Moreover, it requires a delicate integration of controller and estimator…

Systems and Control · Electrical Eng. & Systems 2024-04-04 Xiangyuan Zhang , Weichao Mao , Haoran Qiu , Tamer Başar

Decision Transformer (DT), which employs expressive sequence modeling techniques to perform action generation, has emerged as a promising approach to offline policy optimization. However, DT generates actions conditioned on a desired future…

Machine Learning · Computer Science 2024-06-25 Chen-Xiao Gao , Chenyang Wu , Mingjun Cao , Rui Kong , Zongzhang Zhang , Yang Yu

Recent work has demonstrated the effectiveness of formulating decision making as supervised learning on offline-collected trajectories. Powerful sequence models, such as GPT or BERT, are often employed to encode the trajectories. However,…

Machine Learning · Computer Science 2023-10-31 Zilai Zeng , Ce Zhang , Shijie Wang , Chen Sun

Despite the recent advancements in offline reinforcement learning via supervised learning (RvS) and the success of the decision transformer (DT) architecture in various domains, DTs have fallen short in several challenging benchmarks. The…

Machine Learning · Computer Science 2023-11-21 Anirudhan Badrinath , Yannis Flet-Berliac , Allen Nie , Emma Brunskill

Recent advancements in offline reinforcement learning (RL) have underscored the capabilities of Return-Conditioned Supervised Learning (RCSL), a paradigm that learns the action distribution based on target returns for each state in a…

Machine Learning · Computer Science 2023-12-22 Yuanfu Wang , Chao Yang , Ying Wen , Yu Liu , Yu Qiao

The field of Offline Reinforcement Learning (RL) aims to derive effective policies from pre-collected datasets without active environment interaction. While traditional offline RL algorithms like Conservative Q-Learning (CQL) and Implicit…

Machine Learning · Computer Science 2025-11-21 Ali Murtaza Caunhye , Asad Jeewa

Improving user retention with reinforcement learning~(RL) has attracted increasing attention due to its significant importance in boosting user engagement. However, training the RL policy from scratch without hurting users' experience is…

Information Retrieval · Computer Science 2023-03-14 Kesen Zhao , Lixin Zou , Xiangyu Zhao , Maolin Wang , Dawei yin

As the next generation of mobile systems evolves, artificial intelligence (AI) is expected to deeply integrate with wireless communications for resource management in variable environments. In particular, deep reinforcement learning (DRL)…

Signal Processing · Electrical Eng. & Systems 2024-10-15 Jie Zhang , Jun Li , Long Shi , Zhe Wang , Shi Jin , Wen Chen , H. Vincent Poor

Large transformer models trained on diverse datasets have shown a remarkable ability to learn in-context, achieving high few-shot performance on tasks they were not explicitly trained to solve. In this paper, we study the in-context…

Machine Learning · Computer Science 2023-06-27 Jonathan N. Lee , Annie Xie , Aldo Pacchiano , Yash Chandak , Chelsea Finn , Ofir Nachum , Emma Brunskill

Recent work has shown that offline reinforcement learning (RL) can be formulated as a sequence modeling problem (Chen et al., 2021; Janner et al., 2021) and solved via approaches similar to large-scale language modeling. However, any…

Machine Learning · Computer Science 2022-07-14 Qinqing Zheng , Amy Zhang , Aditya Grover

The recent success of Transformer in natural language processing has sparked its use in various domains. In offline reinforcement learning (RL), Decision Transformer (DT) is emerging as a promising model based on Transformer. However, we…

Machine Learning · Computer Science 2024-05-31 Jeonghye Kim , Suyoung Lee , Woojun Kim , Youngchul Sung

Advancements in reinforcement learning have led to the development of sophisticated models capable of learning complex decision-making tasks. However, efficiently integrating world models with decision transformers remains a challenge. In…

Offline safe reinforcement learning (RL) aims to train a constraint satisfaction policy from a fixed dataset. Current state-of-the-art approaches are based on supervised learning with a conditioned policy. However, these approaches fall…

Machine Learning · Computer Science 2025-01-28 Zijian Guo , Weichao Zhou , Wenchao Li

The Decision Transformer (DT) has established a powerful sequence modeling approach to offline reinforcement learning. It conditions its action predictions on Return-to-Go (RTG), using it both to distinguish trajectory quality during…

Artificial Intelligence · Computer Science 2026-01-23 Yongyi Wang , Hanyu Liu , Lingfeng Li , Bozhou Chen , Ang Li , Qirui Zheng , Xionghui Yang , Wenxin Li

Decision Transformer (DT) formulates offline reinforcement learning as autoregressive sequence modeling, achieving promising results by predicting actions from a sequence of Return-to-Go (RTG), state, and action tokens. However, RTG is a…

Machine Learning · Computer Science 2026-05-08 Yongyi Wang , Hanyu Liu , Lingfeng Li , Bozhou Chen , Ang Li , Qirui Zheng , Xionghui Yang , Chucai Wang , Wenxin Li
‹ Prev 1 2 3 10 Next ›