Related papers: Experiential Reinforcement Learning

Experiential Reflective Learning for Self-Improving LLM Agents

Recent advances in large language models (LLMs) have enabled the development of autonomous agents capable of complex reasoning and multi-step problem solving. However, these agents struggle to adapt to specialized environments and do not…

Machine Learning · Computer Science 2026-04-02 Marc-Antoine Allard , Arnaud Teinturier , Victor Xing , Gautier Viaud

Autonomous Reinforcement Learning: Formalism and Benchmarking

Reinforcement learning (RL) provides a naturalistic framing for learning through trial and error, which is appealing both because of its simplicity and effectiveness and because of its resemblance to how humans and animals acquire skills…

Machine Learning · Computer Science 2022-08-09 Archit Sharma , Kelvin Xu , Nikhil Sardana , Abhishek Gupta , Karol Hausman , Sergey Levine , Chelsea Finn

Evolutionary Reinforcement Learning: A Survey

Reinforcement learning (RL) is a machine learning approach that trains agents to maximize cumulative rewards through interactions with environments. The integration of RL with deep learning has recently resulted in impressive achievements…

Neural and Evolutionary Computing · Computer Science 2023-08-31 Hui Bai , Ran Cheng , Yaochu Jin

A Survey On Enhancing Reinforcement Learning in Complex Environments: Insights from Human and LLM Feedback

Reinforcement learning (RL) is one of the active fields in machine learning, demonstrating remarkable potential in tackling real-world challenges. Despite its promising prospects, this methodology has encountered with issues and challenges,…

Machine Learning · Computer Science 2024-11-21 Alireza Rashidi Laleh , Majid Nili Ahmadabadi

Enhancing Rating-Based Reinforcement Learning to Effectively Leverage Feedback from Large Vision-Language Models

Designing effective reward functions remains a fundamental challenge in reinforcement learning (RL), as it often requires extensive human effort and domain expertise. While RL from human feedback has been successful in aligning agents with…

Machine Learning · Computer Science 2025-06-17 Tung Minh Luu , Younghwan Lee , Donghoon Lee , Sunho Kim , Min Jun Kim , Chang D. Yoo

Experiential Explanations for Reinforcement Learning

Reinforcement learning (RL) systems can be complex and non-interpretable, making it challenging for non-AI experts to understand or intervene in their decisions. This is due in part to the sequential nature of RL in which actions are chosen…

Artificial Intelligence · Computer Science 2025-04-16 Amal Alabdulkarim , Madhuri Singh , Gennie Mansi , Kaely Hall , Upol Ehsan , Mark O. Riedl

ESRL: Efficient Sampling-based Reinforcement Learning for Sequence Generation

Applying Reinforcement Learning (RL) to sequence generation models enables the direct optimization of long-term rewards (\textit{e.g.,} BLEU and human feedback), but typically requires large-scale sampling over a space of action sequences.…

Computation and Language · Computer Science 2023-08-07 Chenglong Wang , Hang Zhou , Yimin Hu , Yifu Huo , Bei Li , Tongran Liu , Tong Xiao , Jingbo Zhu

Experience is the Best Teacher: Motivating Effective Exploration in Reinforcement Learning for LLMs

Reinforcement Learning (RL) with rubric-based rewards has recently shown remarkable progress in enhancing general reasoning capabilities of Large Language Models (LLMs), yet still suffers from ineffective exploration confined to curent…

Artificial Intelligence · Computer Science 2026-03-23 Wenjian Zhang , Kongcheng Zhang , Jiaxin Qi , Baisheng Lai , Jianqiang Huang

Reflexion: Language Agents with Verbal Reinforcement Learning

Large language models (LLMs) have been increasingly used to interact with external environments (e.g., games, compilers, APIs) as goal-driven agents. However, it remains challenging for these language agents to quickly and efficiently learn…

Artificial Intelligence · Computer Science 2023-10-11 Noah Shinn , Federico Cassano , Edward Berman , Ashwin Gopinath , Karthik Narasimhan , Shunyu Yao

ELO-Rated Sequence Rewards: Advancing Reinforcement Learning Models

Reinforcement Learning (RL) heavily relies on the careful design of the reward function. However, accurately assigning rewards to each state-action pair in Long-Term Reinforcement Learning (LTRL) tasks remains a significant challenge. As a…

Machine Learning · Computer Science 2025-06-03 Qi Ju , Falin Hei , Zhemei Fang , Yunfeng Luo

Ensemble Reinforcement Learning: A Survey

Reinforcement Learning (RL) has emerged as a highly effective technique for addressing various scientific and applied problems. Despite its success, certain complex tasks remain challenging to be addressed solely with a single model and…

Machine Learning · Computer Science 2023-12-14 Yanjie Song , P. N. Suganthan , Witold Pedrycz , Junwei Ou , Yongming He , Yingwu Chen , Yutong Wu

A Comprehensive Survey of Reinforcement Learning: From Algorithms to Practical Challenges

Reinforcement Learning (RL) has emerged as a powerful paradigm in Artificial Intelligence (AI), enabling agents to learn optimal behaviors through interactions with their environments. Drawing from the foundations of trial and error, RL…

Artificial Intelligence · Computer Science 2025-02-04 Majid Ghasemi , Amir Hossein Moosavi , Dariush Ebrahimi

oIRL: Robust Adversarial Inverse Reinforcement Learning with Temporally Extended Actions

Explicit engineering of reward functions for given environments has been a major hindrance to reinforcement learning methods. While Inverse Reinforcement Learning (IRL) is a solution to recover reward functions from demonstrations only,…

Machine Learning · Computer Science 2020-02-24 David Venuto , Jhelum Chakravorty , Leonard Boussioux , Junhao Wang , Gavin McCracken , Doina Precup

EMO-RL: Emotion-Rule-Based Reinforcement Learning Enhanced Audio-Language Model for Generalized Speech Emotion Recognition

Although Large Audio-Language Models (LALMs) have exhibited outstanding performance in auditory understanding, their performance in affective computing scenarios, particularly in emotion recognition, reasoning, and subtle sentiment…

Sound · Computer Science 2025-09-23 Pengcheng Li , Botao Zhao , Zuheng Kang , Junqing Peng , Xiaoyang Qu , Yayun He , Jianzong Wang

Improving RL Exploration for LLM Reasoning through Retrospective Replay

Reinforcement learning (RL) has increasingly become a pivotal technique in the post-training of large language models (LLMs). The effective exploration of the output space is essential for the success of RL. We observe that for complex…

Machine Learning · Computer Science 2025-07-08 Shihan Dou , Muling Wu , Jingwen Xu , Rui Zheng , Tao Gui , Qi Zhang , Xuanjing Huang

Natural Language Reinforcement Learning

Artificial intelligence progresses towards the "Era of Experience," where agents are expected to learn from continuous, grounded interaction. We argue that traditional Reinforcement Learning (RL), which typically represents value as a…

Machine Learning · Computer Science 2025-05-29 Xidong Feng , Bo Liu , Yan Song , Haotian Fu , Ziyu Wan , Girish A. Koushik , Zhiyuan Hu , Mengyue Yang , Ying Wen , Jun Wang

A Survey on Model-based Reinforcement Learning

Reinforcement learning (RL) solves sequential decision-making problems via a trial-and-error process interacting with the environment. While RL achieves outstanding success in playing complex video games that allow huge trial-and-error,…

Machine Learning · Computer Science 2022-06-22 Fan-Ming Luo , Tian Xu , Hang Lai , Xiong-Hui Chen , Weinan Zhang , Yang Yu

Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments

Meta reinforcement learning (Meta-RL) is an approach wherein the experience gained from solving a variety of tasks is distilled into a meta-policy. The meta-policy, when adapted over only a small (or just a single) number of steps, is able…

Machine Learning · Computer Science 2022-09-28 Desik Rengarajan , Sapana Chaudhary , Jaewon Kim , Dileep Kalathil , Srinivas Shakkottai

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

We explore a method for improving the performance of large language models through self-reflection and reinforcement learning. By incentivizing the model to generate better self-reflections when it answers incorrectly, we demonstrate that a…

Computation and Language · Computer Science 2025-06-02 Shelly Bensal , Umar Jamil , Christopher Bryant , Melisa Russak , Kiran Kamble , Dmytro Mozolevskyi , Muayad Ali , Waseem AlShikh

Beyond Sparse Rewards: Enhancing Reinforcement Learning with Language Model Critique in Text Generation

Reinforcement learning (RL) can align language models with non-differentiable reward signals, such as human preferences. However, a major challenge arises from the sparsity of these reward signals - typically, there is only a single reward…

Computation and Language · Computer Science 2024-02-20 Meng Cao , Lei Shu , Lei Yu , Yun Zhu , Nevan Wichers , Yinxiao Liu , Lei Meng