Related papers: Solving Compositional Reinforcement Learning Probl…

Reward Learning using Structural Motifs in Inverse Reinforcement Learning

The Inverse Reinforcement Learning (\textit{IRL}) problem has seen rapid evolution in the past few years, with important applications in domains like robotics, cognition, and health. In this work, we explore the inefficacy of current IRL…

Machine Learning · Computer Science 2022-09-28 Raeid Saqur

Robust Subtask Learning for Compositional Generalization

Compositional reinforcement learning is a promising approach for training policies to perform complex long-horizon tasks. Typically, a high-level task is decomposed into a sequence of subtasks and a separate policy is trained to perform…

Machine Learning · Computer Science 2023-06-09 Kishor Jothimurugan , Steve Hsu , Osbert Bastani , Rajeev Alur

Compositional Reinforcement Learning from Logical Specifications

We study the problem of learning control policies for complex tasks given by logical specifications. Recent approaches automatically generate a reward function from a given specification and use a suitable reinforcement learning algorithm…

Machine Learning · Computer Science 2021-12-28 Kishor Jothimurugan , Suguman Bansal , Osbert Bastani , Rajeev Alur

Modular Lifelong Reinforcement Learning via Neural Composition

Humans commonly solve complex problems by decomposing them into easier subproblems and then combining the subproblem solutions. This type of compositional reasoning permits reuse of the subproblem solutions when tackling future tasks that…

Machine Learning · Computer Science 2022-07-04 Jorge A. Mendez , Harm van Seijen , Eric Eaton

Social Interpretable Reinforcement Learning

Reinforcement Learning (RL) bears the promise of being a game-changer in many applications. However, since most of the literature in the field is currently focused on opaque models, the use of RL in high-stakes scenarios, where…

Machine Learning · Computer Science 2025-01-22 Leonardo Lucio Custode , Giovanni Iacca

YES SIR!Optimizing Semantic Space of Negatives with Self-Involvement Ranker

Pre-trained model such as BERT has been proved to be an effective tool for dealing with Information Retrieval (IR) problems. Due to its inspiring performance, it has been widely used to tackle with real-world IR problems such as document…

Information Retrieval · Computer Science 2021-09-15 Ruizhi Pu , Xinyu Zhang , Ruofei Lai , Zikai Guo , Yinxia Zhang , Hao Jiang , Yongkang Wu , Yantao Jia , Zhicheng Dou , Zhao Cao

Compositional Instruction Following with Language Models and Reinforcement Learning

Combining reinforcement learning with language grounding is challenging as the agent needs to explore the environment while simultaneously learning multiple language-conditioned tasks. To address this, we introduce a novel method: the…

Machine Learning · Computer Science 2025-01-23 Vanya Cohen , Geraud Nangue Tasse , Nakul Gopalan , Steven James , Matthew Gombolay , Ray Mooney , Benjamin Rosman

Learning Sparse Rewarded Tasks from Sub-Optimal Demonstrations

Model-free deep reinforcement learning (RL) has demonstrated its superiority on many complex sequential decision-making problems. However, heavy dependence on dense rewards and high sample-complexity impedes the wide adoption of these…

Machine Learning · Computer Science 2020-04-02 Zhuangdi Zhu , Kaixiang Lin , Bo Dai , Jiayu Zhou

From Mapping to Composing: A Two-Stage Framework for Zero-shot Composed Image Retrieval

Composed Image Retrieval (CIR) is a challenging multimodal task that retrieves a target image based on a reference image and accompanying modification text. Due to the high cost of annotating CIR triplet datasets, zero-shot (ZS) CIR has…

Computer Vision and Pattern Recognition · Computer Science 2025-04-28 Yabing Wang , Zhuotao Tian , Qingpei Guo , Zheng Qin , Sanping Zhou , Ming Yang , Le Wang

Compositional Image Retrieval via Instruction-Aware Contrastive Learning

Composed Image Retrieval (CIR) involves retrieving a target image based on a composed query of an image paired with text that specifies modifications or changes to the visual reference. CIR is inherently an instruction-following task, as…

Computer Vision and Pattern Recognition · Computer Science 2024-12-10 Wenliang Zhong , Weizhi An , Feng Jiang , Hehuan Ma , Yuzhi Guo , Junzhou Huang

Learning Invariant Representation for Continual Learning

Continual learning aims to provide intelligent agents that are capable of learning continually a sequence of tasks, building on previously learned knowledge. A key challenge in this learning paradigm is catastrophically forgetting…

Machine Learning · Computer Science 2021-01-18 Ghada Sokar , Decebal Constantin Mocanu , Mykola Pechenizkiy

Inverse Reinforcement Learning without Reinforcement Learning

Inverse Reinforcement Learning (IRL) is a powerful set of techniques for imitation learning that aims to learn a reward function that rationalizes expert demonstrations. Unfortunately, traditional IRL methods suffer from a computational…

Machine Learning · Computer Science 2024-01-31 Gokul Swamy , Sanjiban Choudhury , J. Andrew Bagnell , Zhiwei Steven Wu

STIR$^2$: Reward Relabelling for combined Reinforcement and Imitation Learning on sparse-reward tasks

In the search for more sample-efficient reinforcement-learning (RL) algorithms, a promising direction is to leverage as much external off-policy data as possible. For instance, expert demonstrations. In the past, multiple ideas have been…

Machine Learning · Computer Science 2023-03-01 Jesus Bujalance Martin , Fabien Moutarde

Replay Failures as Successes: Sample-Efficient Reinforcement Learning for Instruction Following

Reinforcement Learning (RL) has shown promise for aligning Large Language Models (LLMs) to follow instructions with various constraints. Despite the encouraging results, RL improvement inevitably relies on sampling successful, high-quality…

Artificial Intelligence · Computer Science 2025-12-30 Kongcheng Zhang , Qi Yao , Shunyu Liu , Wenjian Zhang , Min Cen , Yang Zhou , Wenkai Fang , Yiru Zhao , Baisheng Lai , Mingli Song

SIR - an Efficient Solver for Systems of Equations

The Semi-Implicit Root solver (SIR) is an iterative method for globally convergent solution of systems of nonlinear equations. Since publication, SIR has proven robustness for a great variety of problems. We here present MATLAB and MAPLE…

Computational Physics · Physics 2017-04-14 Jan Scheffel , Kristoffer Lindvall

Accelerating Reinforcement Learning by Composing Solutions of Automatically Identified Subtasks

This paper discusses a system that accelerates reinforcement learning by using transfer from related tasks. Without such transfer, even if two tasks are very similar at some abstract level, an extensive re-learning effort is required. The…

Artificial Intelligence · Computer Science 2011-06-10 C. Drummond

SIR: Self-supervised Image Rectification via Seeing the Same Scene from Multiple Different Lenses

Deep learning has demonstrated its power in image rectification by leveraging the representation capacity of deep neural networks via supervised training based on a large-scale synthetic dataset. However, the model may overfit the synthetic…

Computer Vision and Pattern Recognition · Computer Science 2021-06-21 Jinlong Fan , Jing Zhang , Dacheng Tao

RL as Regressor: A Reinforcement Learning Approach for Function Approximation

Standard regression techniques, while powerful, are often constrained by predefined, differentiable loss functions such as mean squared error. These functions may not fully capture the desired behavior of a system, especially when dealing…

Machine Learning · Computer Science 2025-08-04 Yongchao Huang

State of the Art of User Simulation approaches for conversational information retrieval

Conversational Information Retrieval (CIR) is an emerging field of Information Retrieval (IR) at the intersection of interactive IR and dialogue systems for open domain information needs. In order to optimize these interactions and enhance…

Information Retrieval · Computer Science 2022-01-11 Pierre Erbacher , Laure Soulier , Ludovic Denoyer

Curricular Subgoals for Inverse Reinforcement Learning

Inverse Reinforcement Learning (IRL) aims to reconstruct the reward function from expert demonstrations to facilitate policy learning, and has demonstrated its remarkable success in imitation learning. To promote expert-like behavior,…

Machine Learning · Computer Science 2023-06-16 Shunyu Liu , Yunpeng Qing , Shuqi Xu , Hongyan Wu , Jiangtao Zhang , Jingyuan Cong , Tianhao Chen , Yunfu Liu , Mingli Song