Related papers: Generalized Data Distribution Iteration

GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning

Deep Q Network (DQN) firstly kicked the door of deep reinforcement learning (DRL) via combining deep learning (DL) with reinforcement learning (RL), which has noticed that the distribution of the acquired data would change during the…

Machine Learning · Computer Science 2022-01-11 Jiajun Fan , Changnan Xiao , Yue Huang

A Review for Deep Reinforcement Learning in Atari:Benchmarks, Challenges, and Solutions

The Arcade Learning Environment (ALE) is proposed as an evaluation platform for empirically assessing the generality of agents across dozens of Atari 2600 games. ALE offers various challenging problems and has drawn significant attention…

Artificial Intelligence · Computer Science 2023-02-28 Jiajun Fan

Overcoming Overfitting in Reinforcement Learning via Gaussian Process Diffusion Policy

One of the key challenges that Reinforcement Learning (RL) faces is its limited capability to adapt to a change of data distribution caused by uncertainties. This challenge arises especially in RL systems using deep neural networks as…

Machine Learning · Computer Science 2025-06-17 Amornyos Horprasert , Esa Apriaskar , Xingyu Liu , Lanlan Su , Lyudmila S. Mihaylova

Generative AI for Deep Reinforcement Learning: Framework, Analysis, and Use Cases

As a form of artificial intelligence (AI) technology based on interactive learning, deep reinforcement learning (DRL) has been widely applied across various fields and has achieved remarkable accomplishments. However, DRL faces certain…

Machine Learning · Computer Science 2025-02-18 Geng Sun , Wenwen Xie , Dusit Niyato , Fang Mei , Jiawen Kang , Hongyang Du , Shiwen Mao

Pretraining Representations for Data-Efficient Reinforcement Learning

Data efficiency is a key challenge for deep reinforcement learning. We address this problem by using unlabeled data to pretrain an encoder which is then finetuned on a small amount of task-specific data. To encourage learning…

Machine Learning · Computer Science 2021-06-10 Max Schwarzer , Nitarshan Rajkumar , Michael Noukhovitch , Ankesh Anand , Laurent Charlin , Devon Hjelm , Philip Bachman , Aaron Courville

Statistics and Samples in Distributional Reinforcement Learning

We present a unifying framework for designing and analysing distributional reinforcement learning (DRL) algorithms in terms of recursively estimating statistics of the return distribution. Our key insight is that DRL algorithms can be…

Machine Learning · Statistics 2019-02-22 Mark Rowland , Robert Dadashi , Saurabh Kumar , Rémi Munos , Marc G. Bellemare , Will Dabney

Generalization in Reinforcement Learning for Radio Access Networks

Modern RAN operate in highly dynamic and heterogeneous environments, where hand-tuned, rule-based RRM algorithms often underperform. While RL can surpass such heuristics in constrained settings, the diversity of deployments and…

Machine Learning · Computer Science 2026-01-29 Burak Demirel , Yu Wang , Cristian Tatino , Pablo Soldati

Is Deep Reinforcement Learning Really Superhuman on Atari? Leveling the playing field

Consistent and reproducible evaluation of Deep Reinforcement Learning (DRL) is not straightforward. In the Arcade Learning Environment (ALE), small changes in environment parameters such as stochasticity or the maximum allowed play time can…

Artificial Intelligence · Computer Science 2019-11-11 Marin Toromanoff , Emilie Wirbel , Fabien Moutarde

A Multi-Agent Deep Reinforcement Learning Approach for RAN Resource Allocation in O-RAN

Artificial intelligence (AI) and Machine Learning (ML) are considered as key enablers for realizing the full potential of fifth-generation (5G) and beyond mobile networks, particularly in the context of resource management and…

Networking and Internet Architecture · Computer Science 2023-07-06 Farhad Rezazadeh , Lanfranco Zanzi , Francesco Devoti , Sergio Barrachina-Munoz , Engin Zeydan , Xavier Costa-Pérez , Josep Mangues-Bafalluy

Active Reinforcement Learning over MDPs

The past decade has seen the rapid development of Reinforcement Learning, which acquires impressive performance with numerous training resources. However, one of the greatest challenges in RL is generalization efficiency (i.e.,…

Machine Learning · Computer Science 2021-08-18 Qi Yang , Peng Yang , Ke Tang

Distributed Resource Scheduling for Large-Scale MEC Systems: A Multi-Agent Ensemble Deep Reinforcement Learning with Imitation Acceleration

We consider the optimization of distributed resource scheduling to minimize the sum of task latency and energy consumption for all the Internet of things devices (IoTDs) in a large-scale mobile edge computing (MEC) system. To address this…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-05-27 Feibo Jiang , Li Dong , Kezhi Wang , Kun Yang , Cunhua Pan

A Deep Reinforcement Learning Approach to Efficient Distributed Optimization

In distributed optimization, the practical problem-solving performance is essentially sensitive to algorithm selection, parameter setting, problem type and data pattern. Thus, it is often laborious to acquire a highly efficient method for a…

Optimization and Control · Mathematics 2024-01-04 Daokuan Zhu , Tianqi Xu , Jie Lu

Agent57: Outperforming the Atari Human Benchmark

Atari games have been a long-standing benchmark in the reinforcement learning (RL) community for the past decade. This benchmark was proposed to test general competency of RL algorithms. Previous work has achieved good average performance…

Machine Learning · Computer Science 2020-03-31 Adrià Puigdomènech Badia , Bilal Piot , Steven Kapturowski , Pablo Sprechmann , Alex Vitvitskyi , Daniel Guo , Charles Blundell

Deep Reinforcement Learning for Swarm Systems

Recently, deep reinforcement learning (RL) methods have been applied successfully to multi-agent scenarios. Typically, these methods rely on a concatenation of agent states to represent the information content required for decentralized…

Multiagent Systems · Computer Science 2019-06-07 Maximilian Hüttenrauch , Adrian Šošić , Gerhard Neumann

GenAI-based Multi-Agent Reinforcement Learning towards Distributed Agent Intelligence: A Generative-RL Agent Perspective

Multi-agent reinforcement learning faces fundamental challenges that conventional approaches have failed to overcome: exponentially growing joint action spaces, non-stationary environments where simultaneous learning creates moving targets,…

Artificial Intelligence · Computer Science 2025-07-15 Hang Wang , Junshan Zhang

Distributed Deep Reinforcement Learning: An Overview

Deep reinforcement learning (DRL) is a very active research area. However, several technical and scientific issues require to be addressed, amongst which we can mention data inefficiency, exploration-exploitation trade-off, and multi-task…

Machine Learning · Computer Science 2020-11-24 Mohammad Reza Samsami , Hossein Alimadad

Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain

Deep Reinforcement Learning (DRL) and Deep Multi-agent Reinforcement Learning (MARL) have achieved significant successes across a wide range of domains, including game AI, autonomous vehicles, robotics, and so on. However, DRL and deep MARL…

Artificial Intelligence · Computer Science 2023-02-03 Jianye Hao , Tianpei Yang , Hongyao Tang , Chenjia Bai , Jinyi Liu , Zhaopeng Meng , Peng Liu , Zhen Wang

Improve the Training Efficiency of DRL for Wireless Communication Resource Allocation: The Role of Generative Diffusion Models

Dynamic resource allocation in mobile wireless networks involves complex, time-varying optimization problems, motivating the adoption of deep reinforcement learning (DRL). However, most existing works rely on pre-trained policies,…

Machine Learning · Computer Science 2025-02-12 Xinren Zhang , Jiadong Yu

GRI: General Reinforced Imitation and its Application to Vision-Based Autonomous Driving

Deep reinforcement learning (DRL) has been demonstrated to be effective for several complex decision-making applications such as autonomous driving and robotics. However, DRL is notoriously limited by its high sample complexity and its lack…

Robotics · Computer Science 2022-05-19 Raphael Chekroun , Marin Toromanoff , Sascha Hornauer , Fabien Moutarde

When Multiple Agents Learn to Schedule: A Distributed Radio Resource Management Framework

Interference among concurrent transmissions in a wireless network is a key factor limiting the system performance. One way to alleviate this problem is to manage the radio resources in order to maximize either the average or the worst-case…

Machine Learning · Computer Science 2019-06-24 Navid Naderializadeh , Jaroslaw Sydir , Meryem Simsek , Hosein Nikopour , Shilpa Talwar