Related papers: Data Valuation using Reinforcement Learning

Normality-Guided Distributional Reinforcement Learning for Continuous Control

Learning a predictive model of the mean return, or value function, plays a critical role in many reinforcement learning algorithms. Distributional reinforcement learning (DRL) has been shown to improve performance by modeling the value…

Machine Learning · Computer Science 2025-07-08 Ju-Seung Byun , Andrew Perrault

A Survey on Deep Reinforcement Learning for Data Processing and Analytics

Data processing and analytics are fundamental and pervasive. Algorithms play a vital role in data processing and analytics where many algorithm designs have incorporated heuristics and general rules from human knowledge and experience to…

Machine Learning · Computer Science 2022-02-07 Qingpeng Cai , Can Cui , Yiyuan Xiong , Wei Wang , Zhongle Xie , Meihui Zhang

Value Enhancement of Reinforcement Learning via Efficient and Robust Trust Region Optimization

Reinforcement learning (RL) is a powerful machine learning technique that enables an intelligent agent to learn an optimal policy that maximizes the cumulative rewards in sequential decision making. Most of methods in the existing…

Machine Learning · Statistics 2023-01-06 Chengchun Shi , Zhengling Qi , Jianing Wang , Fan Zhou

How Should We Meta-Learn Reinforcement Learning Algorithms?

The process of meta-learning algorithms from data, instead of relying on manual design, is growing in popularity as a paradigm for improving the performance of machine learning systems. Meta-learning shows particular promise for…

Machine Learning · Computer Science 2025-09-11 Alexander David Goldie , Zilin Wang , Jaron Cohen , Jakob Nicolaus Foerster , Shimon Whiteson

Optimizing Data Usage via Differentiable Rewards

To acquire a new skill, humans learn better and faster if a tutor, based on their current knowledge level, informs them of how much attention they should pay to particular content or practice problems. Similarly, a machine learning model…

Machine Learning · Computer Science 2021-06-18 Xinyi Wang , Hieu Pham , Paul Michel , Antonios Anastasopoulos , Jaime Carbonell , Graham Neubig

Value Driven Representation for Human-in-the-Loop Reinforcement Learning

Interactive adaptive systems powered by Reinforcement Learning (RL) have many potential applications, such as intelligent tutoring systems. In such systems there is typically an external human system designer that is creating, monitoring…

Artificial Intelligence · Computer Science 2020-04-06 Ramtin Keramati , Emma Brunskill

VRL3: A Data-Driven Framework for Visual Deep Reinforcement Learning

We propose VRL3, a powerful data-driven framework with a simple design for solving challenging visual deep reinforcement learning (DRL) tasks. We analyze a number of major obstacles in taking a data-driven approach, and present a suite of…

Computer Vision and Pattern Recognition · Computer Science 2023-04-03 Che Wang , Xufang Luo , Keith Ross , Dongsheng Li

A Survey on Explainable Deep Reinforcement Learning

Deep Reinforcement Learning (DRL) has achieved remarkable success in sequential decision-making tasks across diverse domains, yet its reliance on black-box neural architectures hinders interpretability, trust, and deployment in high-stakes…

Machine Learning · Computer Science 2025-02-12 Zelei Cheng , Jiahao Yu , Xinyu Xing

Is Value Functions Estimation with Classification Plug-and-play for Offline Reinforcement Learning?

In deep Reinforcement Learning (RL), value functions are typically approximated using deep neural networks and trained via mean squared error regression objectives to fit the true value functions. Recent research has proposed an alternative…

Machine Learning · Computer Science 2024-11-19 Denis Tarasov , Kirill Brilliantov , Dmitrii Kharlapenko

Data Valuation for Offline Reinforcement Learning

The success of deep reinforcement learning (DRL) hinges on the availability of training data, which is typically obtained via a large number of environment interactions. In many real-world scenarios, costs and risks are associated with…

Machine Learning · Computer Science 2022-05-20 Amir Abolfazli , Gregory Palmer , Daniel Kudenko

Causal Deep Reinforcement Learning Using Observational Data

Deep reinforcement learning (DRL) requires the collection of interventional data, which is sometimes expensive and even unethical in the real world, such as in the autonomous driving and the medical field. Offline reinforcement learning…

Machine Learning · Computer Science 2023-06-12 Wenxuan Zhu , Chao Yu , Qiang Zhang

Quality-Aware Multimodal Saliency Detection via Deep Reinforcement Learning

Incorporating various modes of information into the machine learning procedure is becoming a new trend. And data from various source can provide more information than single one no matter they are heterogeneous or homogeneous. Existing deep…

Computer Vision and Pattern Recognition · Computer Science 2018-11-28 Xiao Wang , Tao Sun , Rui Yang , Chenglong Li , Bin Luo , Jin Tang

A Comprehensive Survey of Reinforcement Learning: From Algorithms to Practical Challenges

Reinforcement Learning (RL) has emerged as a powerful paradigm in Artificial Intelligence (AI), enabling agents to learn optimal behaviors through interactions with their environments. Drawing from the foundations of trial and error, RL…

Artificial Intelligence · Computer Science 2025-02-04 Majid Ghasemi , Amir Hossein Moosavi , Dariush Ebrahimi

Dynamic Learning Rate for Deep Reinforcement Learning: A Bandit Approach

In deep Reinforcement Learning (RL), the learning rate critically influences both stability and performance, yet its optimal value shifts during training as the environment and policy evolve. Standard decay schedulers assume monotonic…

Machine Learning · Computer Science 2025-10-09 Henrique Donâncio , Antoine Barrier , Leah F. South , Florence Forbes

LearnAlign: Data Selection for LLM Reinforcement Learning with Improved Gradient Alignment

Reinforcement learning with verifiable rewards (RLVR) has become a key technique for enhancing LLMs' reasoning abilities, yet its data inefficiency remains a major bottleneck. To address this critical yet challenging issue, we present a…

Machine Learning · Computer Science 2026-04-28 Shipeng Li , Zhiqin Yang , Shikun Li , Xiaobo Xia , Hengyu Liu , Xinghua Zhang , Gaode Chen , Dong Fang , Ying Tai , Zhe Peng

Evolutionary Reinforcement Learning: A Survey

Reinforcement learning (RL) is a machine learning approach that trains agents to maximize cumulative rewards through interactions with environments. The integration of RL with deep learning has recently resulted in impressive achievements…

Neural and Evolutionary Computing · Computer Science 2023-08-31 Hui Bai , Ran Cheng , Yaochu Jin

A Transferable and Automatic Tuning of Deep Reinforcement Learning for Cost Effective Phishing Detection

Many challenging real-world problems require the deployment of ensembles multiple complementary learning models to reach acceptable performance levels. While effective, applying the entire ensemble to every sample is costly and often…

Cryptography and Security · Computer Science 2022-09-20 Orel Lavie , Asaf Shabtai , Gilad Katz

Meta-Learning-Based Deep Reinforcement Learning for Multiobjective Optimization Problems

Deep reinforcement learning (DRL) has recently shown its success in tackling complex combinatorial optimization problems. When these problems are extended to multiobjective ones, it becomes difficult for the existing DRL approaches to…

Artificial Intelligence · Computer Science 2022-02-15 Zizhen Zhang , Zhiyuan Wu , Hang Zhang , Jiahai Wang

Train Hard, Fight Easy: Robust Meta Reinforcement Learning

A major challenge of reinforcement learning (RL) in real-world applications is the variation between environments, tasks or clients. Meta-RL (MRL) addresses this issue by learning a meta-policy that adapts to new tasks. Standard MRL methods…

Machine Learning · Computer Science 2023-10-03 Ido Greenberg , Shie Mannor , Gal Chechik , Eli Meirom

Deep Reinforcement Learning with Weighted Q-Learning

Reinforcement learning algorithms based on Q-learning are driving Deep Reinforcement Learning (DRL) research towards solving complex problems and achieving super-human performance on many of them. Nevertheless, Q-Learning is known to be…

Machine Learning · Computer Science 2022-06-14 Andrea Cini , Carlo D'Eramo , Jan Peters , Cesare Alippi