Related papers: Instance-Dependent Continuous-Time Reinforcement L…

Sample and Computationally Efficient Continuous-Time Reinforcement Learning with General Function Approximation

Continuous-time reinforcement learning (CTRL) provides a principled framework for sequential decision-making in environments where interactions evolve continuously over time. Despite its empirical success, the theoretical understanding of…

Machine Learning · Computer Science 2025-05-22 Runze Zhao , Yue Yu , Adams Yiyue Zhu , Chen Yang , Dongruo Zhou

Probabilistic Satisfaction of Temporal Logic Constraints in Reinforcement Learning via Adaptive Policy-Switching

Constrained Reinforcement Learning (CRL) is a subset of machine learning that introduces constraints into the traditional reinforcement learning (RL) framework. Unlike conventional RL which aims solely to maximize cumulative rewards, CRL…

Artificial Intelligence · Computer Science 2024-12-02 Xiaoshan Lin , Sadık Bera Yüksel , Yasin Yazıcıoğlu , Derya Aksaray

Instance-Dependent Confidence and Early Stopping for Reinforcement Learning

Various algorithms for reinforcement learning (RL) exhibit dramatic variation in their convergence rates as a function of problem structure. Such problem-dependent behavior is not captured by worst-case analyses and has accordingly inspired…

Machine Learning · Statistics 2022-01-24 Koulik Khamaru , Eric Xia , Martin J. Wainwright , Michael I. Jordan

Sample-efficient and Scalable Exploration in Continuous-Time RL

Reinforcement learning algorithms are typically designed for discrete-time dynamics, even though the underlying real-world control systems are often continuous in time. In this paper, we study the problem of continuous-time reinforcement…

Machine Learning · Computer Science 2026-03-03 Klemens Iten , Lenart Treven , Bhavya Sukhija , Florian Dörfler , Andreas Krause

Learning Temporal Point Processes via Reinforcement Learning

Social goods, such as healthcare, smart city, and information networks, often produce ordered event data in continuous time. The generative processes of these event data can be very complex, requiring flexible models to capture their…

Machine Learning · Computer Science 2020-12-29 Shuang Li , Shuai Xiao , Shixiang Zhu , Nan Du , Yao Xie , Le Song

Density Constrained Reinforcement Learning

We study constrained reinforcement learning (CRL) from a novel perspective by setting constraints directly on state density functions, rather than the value functions considered by previous works. State density has a clear physical and…

Machine Learning · Computer Science 2021-06-25 Zengyi Qin , Yuxiao Chen , Chuchu Fan

Continuous-Time Model-Based Reinforcement Learning

Model-based reinforcement learning (MBRL) approaches rely on discrete-time state transition models whereas physical systems and the vast majority of control tasks operate in continuous-time. To avoid time-discretization approximation of the…

Machine Learning · Computer Science 2021-06-14 Çağatay Yıldız , Markus Heinonen , Harri Lähdesmäki

CTRL-RAG: Contrastive Likelihood Reward Based Reinforcement Learning for Context-Faithful RAG Models

With the growing use of Retrieval-Augmented Generation (RAG), training large language models (LLMs) for context-sensitive reasoning and faithfulness is increasingly important. Existing RAG-oriented reinforcement learning (RL) methods rely…

Computation and Language · Computer Science 2026-03-06 Zhehao Tan , Yihan Jiao , Dan Yang , Junjie Wang , Duolin Sun , Jie Feng , Xidong Wang , Lei Liu , Yue Shen , Jian Wang , Jinjie Gu

TCRL: Temporal-Coupled Adversarial Training for Robust Constrained Reinforcement Learning in Worst-Case Scenarios

Constrained Reinforcement Learning (CRL) aims to optimize decision-making policies under constraint conditions, making it highly applicable to safety-critical domains such as autonomous driving, robotics, and power grid management. However,…

Machine Learning · Computer Science 2026-02-16 Wentao Xu , Zhongming Yao , Weihao Li , Zhenghang Song , Yumeng Song , Tianyi Li , Yushuai Li

Learning in Context, Guided by Choice: A Reward-Free Paradigm for Reinforcement Learning with Transformers

In-context reinforcement learning (ICRL) leverages the in-context learning capabilities of transformer models (TMs) to efficiently generalize to unseen sequential decision-making tasks without parameter updates. However, existing ICRL…

Machine Learning · Computer Science 2026-02-10 Juncheng Dong , Bowen He , Moyang Guo , Ethan X. Fang , Zhuoran Yang , Vahid Tarokh

A Survey on Model-based Reinforcement Learning

Reinforcement learning (RL) solves sequential decision-making problems via a trial-and-error process interacting with the environment. While RL achieves outstanding success in playing complex video games that allow huge trial-and-error,…

Machine Learning · Computer Science 2022-06-22 Fan-Ming Luo , Tian Xu , Hang Lai , Xiong-Hui Chen , Weinan Zhang , Yang Yu

Continuous-Time Reinforcement Learning: New Design Algorithms with Theoretical Insights and Performance Guarantees

Continuous-time nonlinear optimal control problems hold great promise in real-world applications. After decades of development, reinforcement learning (RL) has achieved some of the greatest successes as a general nonlinear control design…

Systems and Control · Electrical Eng. & Systems 2023-07-19 Brent A. Wallace , Jennie Si

Lifelong Incremental Reinforcement Learning with Online Bayesian Inference

A central capability of a long-lived reinforcement learning (RL) agent is to incrementally adapt its behavior as its environment changes, and to incrementally build upon previous experiences to facilitate future learning in real-world…

Machine Learning · Computer Science 2021-02-15 Zhi Wang , Chunlin Chen , Daoyi Dong

Towards Large-Scale In-Context Reinforcement Learning by Meta-Training in Randomized Worlds

In-Context Reinforcement Learning (ICRL) enables agents to learn automatically and on-the-fly from their interactive experiences. However, a major challenge in scaling up ICRL is the lack of scalable task collections. To address this, we…

Machine Learning · Computer Science 2025-11-04 Fan Wang , Pengtao Shao , Yiming Zhang , Bo Yu , Shaoshan Liu , Ning Ding , Yang Cao , Yu Kang , Haifeng Wang

Beyond No Regret: Instance-Dependent PAC Reinforcement Learning

The theory of reinforcement learning has focused on two fundamental problems: achieving low regret, and identifying $\epsilon$-optimal policies. While a simple reduction allows one to apply a low-regret algorithm to obtain an…

Machine Learning · Computer Science 2022-06-23 Andrew Wagenmaker , Max Simchowitz , Kevin Jamieson

TR-ICRL: Test-Time Rethinking for In-Context Reinforcement Learning

In-Context Reinforcement Learning (ICRL) enables Large Language Models (LLMs) to learn online from external rewards directly within the context window. However, a central challenge in ICRL is reward estimation, as models typically lack…

Computation and Language · Computer Science 2026-04-02 Wenxuan Jiang , Yuxin Zuo , Zijian Zhang , Xuecheng Wu , Zining Fan , Wenxuan Liu , Li Chen , Xiaoyu Li , Xuezhi Cao , Xiaolong Jin , Ninghao Liu

TW-CRL: Time-Weighted Contrastive Reward Learning for Efficient Inverse Reinforcement Learning

Episodic tasks in Reinforcement Learning (RL) often pose challenges due to sparse reward signals and high-dimensional state spaces, which hinder efficient learning. Additionally, these tasks often feature hidden "trap states" --…

Machine Learning · Computer Science 2025-05-23 Yuxuan Li , Yicheng Gao , Ning Yang , Stephen Xia

Simplified Temporal Consistency Reinforcement Learning

Reinforcement learning is able to solve complex sequential decision-making tasks but is currently limited by sample efficiency and required computation. To improve sample efficiency, recent work focuses on model-based RL which interleaves…

Machine Learning · Computer Science 2023-06-19 Yi Zhao , Wenshuai Zhao , Rinu Boney , Juho Kannala , Joni Pajarinen

Towards Monotonic Improvement in In-Context Reinforcement Learning

In-Context Reinforcement Learning (ICRL) has emerged as a promising paradigm for developing agents that can rapidly adapt to new tasks by leveraging past experiences as context, without updating their parameters. Recent approaches train…

Machine Learning · Computer Science 2025-09-30 Wenhao Zhang , Shao Zhang , Xihuai Wang , Yang Li , Ying Wen

TTR-Based Reward for Reinforcement Learning with Implicit Model Priors

Model-free reinforcement learning (RL) is a powerful approach for learning control policies directly from high-dimensional state and observation. However, it tends to be data-inefficient, which is especially costly in robotic learning…

Robotics · Computer Science 2020-10-14 Xubo Lyu , Mo Chen