Related papers: Automating Predictive Modeling Process using Reinf…

Automated Reinforcement Learning: An Overview

Reinforcement Learning and, recently, Deep Reinforcement Learning are popular methods for solving sequential decision-making problems modeled as Markov Decision Processes. RL modeling of a problem and selecting algorithms and…

Machine Learning · Computer Science 2026-03-10 Reza Refaei Afshar , Joaquin Vanschoren , Uzay Kaymak , Rui Zhang , Yaoxin Wu , Wen Song , Yingqian Zhang

Active Reinforcement Learning over MDPs

The past decade has seen the rapid development of Reinforcement Learning, which acquires impressive performance with numerous training resources. However, one of the greatest challenges in RL is generalization efficiency (i.e.,…

Machine Learning · Computer Science 2021-08-18 Qi Yang , Peng Yang , Ke Tang

Rethinking Reinforcement Learning for Recommendation: A Prompt Perspective

Modern recommender systems aim to improve user experience. As reinforcement learning (RL) naturally fits this objective -- maximizing an user's reward per session -- it has become an emerging topic in recommender systems. Developing…

Information Retrieval · Computer Science 2022-06-16 Xin Xin , Tiago Pimentel , Alexandros Karatzoglou , Pengjie Ren , Konstantina Christakopoulou , Zhaochun Ren

Sample Efficient Active Algorithms for Offline Reinforcement Learning

Offline reinforcement learning (RL) enables policy learning from static data but often suffers from poor coverage of the state-action space and distributional shift problems. This problem can be addressed by allowing limited online…

Machine Learning · Computer Science 2026-02-03 Soumyadeep Roy , Shashwat Kushwaha , Ambedkar Dukkipati

Advances in Preference-based Reinforcement Learning: A Review

Reinforcement Learning (RL) algorithms suffer from the dependency on accurately engineered reward functions to properly guide the learning agents to do the required tasks. Preference-based reinforcement learning (PbRL) addresses that by…

Artificial Intelligence · Computer Science 2024-08-23 Youssef Abdelkareem , Shady Shehata , Fakhri Karray

Reinforcement Learning based dynamic weighing of Ensemble Models for Time Series Forecasting

Ensemble models are powerful model building tools that are developed with a focus to improve the accuracy of model predictions. They find applications in time series forecasting in varied scenarios including but not limited to process…

Machine Learning · Computer Science 2020-08-21 Satheesh K. Perepu , Bala Shyamala Balaji , Hemanth Kumar Tanneru , Sudhakar Kathari , Vivek Shankar Pinnamaraju

Prediction Based Decision Making for Autonomous Highway Driving

Autonomous driving decision-making is a challenging task due to the inherent complexity and uncertainty in traffic. For example, adjacent vehicles may change their lane or overtake at any time to pass a slow vehicle or to help traffic flow.…

Robotics · Computer Science 2022-09-07 Mustafa Yildirim , Sajjad Mozaffari , Luc McCutcheon , Mehrdad Dianati , Alireza Tamaddoni-Nezhad Saber Fallah

Reinforcement Learning for Combinatorial Optimization: A Survey

Many traditional algorithms for solving combinatorial optimization problems involve using hand-crafted heuristics that sequentially construct a solution. Such heuristics are designed by domain experts and may often be suboptimal due to the…

Machine Learning · Computer Science 2020-12-25 Nina Mazyavkina , Sergey Sviridov , Sergei Ivanov , Evgeny Burnaev

Augmented Reinforcement Learning Framework For Enhancing Decision-Making In Machine Learning Models Using External Agents

This work proposes a novel technique Augmented Reinforcement Learning framework for the improvement of decision-making capabilities of machine learning models. The introduction of agents as external overseers checks on model decisions. The…

Machine Learning · Computer Science 2025-08-05 Sandesh Kumar Singh

APRIL: Active Preference-learning based Reinforcement Learning

This paper focuses on reinforcement learning (RL) with limited prior knowledge. In the domain of swarm robotics for instance, the expert can hardly design a reward function or demonstrate the target behavior, forbidding the use of both…

Machine Learning · Computer Science 2012-08-07 Riad Akrour , Marc Schoenauer , Michèle Sebag

Maximum Likelihood Reinforcement Learning

Reinforcement learning is the method of choice to train models in sampling-based setups with binary outcome feedback, such as navigation, code generation, and mathematical problem solving. In such settings, models implicitly induce a…

Machine Learning · Computer Science 2026-02-04 Fahim Tajwar , Guanning Zeng , Yueer Zhou , Yuda Song , Daman Arora , Yiding Jiang , Jeff Schneider , Ruslan Salakhutdinov , Haiwen Feng , Andrea Zanette

APPLR: Adaptive Planner Parameter Learning from Reinforcement

Classical navigation systems typically operate using a fixed set of hand-picked parameters (e.g. maximum speed, sampling rate, inflation radius, etc.) and require heavy expert re-tuning in order to work in new environments. To mitigate this…

Robotics · Computer Science 2020-11-03 Zifan Xu , Gauraang Dhamankar , Anirudh Nair , Xuesu Xiao , Garrett Warnell , Bo Liu , Zizhao Wang , Peter Stone

Searching for Plannable Domains can Speed up Reinforcement Learning

Reinforcement learning (RL) involves sequential decision making in uncertain environments. The aim of the decision-making agent is to maximize the benefit of acting in its environment over an extended period of time. Finding an optimal…

Artificial Intelligence · Computer Science 2007-05-23 Istvan Szita , Balint Takacs , Andras Lorincz

A Survey on Model-based Reinforcement Learning

Reinforcement learning (RL) solves sequential decision-making problems via a trial-and-error process interacting with the environment. While RL achieves outstanding success in playing complex video games that allow huge trial-and-error,…

Machine Learning · Computer Science 2022-06-22 Fan-Ming Luo , Tian Xu , Hang Lai , Xiong-Hui Chen , Weinan Zhang , Yang Yu

Provable Reward-Agnostic Preference-Based Reinforcement Learning

Preference-based Reinforcement Learning (PbRL) is a paradigm in which an RL agent learns to optimize a task using pair-wise preference-based feedback over trajectories, rather than explicit reward signals. While PbRL has demonstrated…

Machine Learning · Computer Science 2024-04-18 Wenhao Zhan , Masatoshi Uehara , Wen Sun , Jason D. Lee

A Comprehensive Survey of Reinforcement Learning: From Algorithms to Practical Challenges

Reinforcement Learning (RL) has emerged as a powerful paradigm in Artificial Intelligence (AI), enabling agents to learn optimal behaviors through interactions with their environments. Drawing from the foundations of trial and error, RL…

Artificial Intelligence · Computer Science 2025-02-04 Majid Ghasemi , Amir Hossein Moosavi , Dariush Ebrahimi

Attention as a Compass: Efficient Exploration for Process-Supervised RL in Reasoning Models

Reinforcement Learning (RL) has shown remarkable success in enhancing the reasoning capabilities of Large Language Models (LLMs). Process-Supervised RL (PSRL) has emerged as a more effective paradigm compared to outcome-based RL. However,…

Machine Learning · Computer Science 2025-10-01 Runze Liu , Jiakang Wang , Yuling Shi , Zhihui Xie , Chenxin An , Kaiyan Zhang , Jian Zhao , Xiaodong Gu , Lei Lin , Wenping Hu , Xiu Li , Fuzheng Zhang , Guorui Zhou , Kun Gai

Adaptive Reinforcement Learning for Dynamic Configuration Allocation in Pre-Production Testing

Ensuring reliability in modern software systems requires rigorous pre-production testing across highly heterogeneous and evolving environments. Because exhaustive evaluation is infeasible, practitioners must decide how to allocate limited…

Software Engineering · Computer Science 2025-10-08 Yu Zhu

Discriminator Augmented Model-Based Reinforcement Learning

By planning through a learned dynamics model, model-based reinforcement learning (MBRL) offers the prospect of good performance with little environment interaction. However, it is common in practice for the learned model to be inaccurate,…

Machine Learning · Computer Science 2021-03-31 Behzad Haghgoo , Allan Zhou , Archit Sharma , Chelsea Finn

Direct Preference-based Policy Optimization without Reward Modeling

Preference-based reinforcement learning (PbRL) is an approach that enables RL agents to learn from preference, which is particularly useful when formulating a reward function is challenging. Existing PbRL methods generally involve a…

Machine Learning · Computer Science 2023-10-30 Gaon An , Junhyeok Lee , Xingdong Zuo , Norio Kosaka , Kyung-Min Kim , Hyun Oh Song