Related papers: STAND: Self-Aware Precondition Induction for Inter…

Incremental Reinforcement Learning --- a New Continuous Reinforcement Learning Frame Based on Stochastic Differential Equation methods

Continuous reinforcement learning such as DDPG and A3C are widely used in robot control and autonomous driving. However, both methods have theoretical weaknesses. While DDPG cannot control noises in the control process, A3C does not satisfy…

Machine Learning · Computer Science 2019-08-09 Tianhao Chen , Limei Cheng , Yang Liu , Wenchuan Jia , Shugen Ma

AI Autonomy : Self-Initiated Open-World Continual Learning and Adaptation

As more and more AI agents are used in practice, it is time to think about how to make these agents fully autonomous so that they can (1) learn by themselves continually in a self-motivated and self-initiated manner rather than being…

Artificial Intelligence · Computer Science 2023-04-21 Bing Liu , Sahisnu Mazumder , Eric Robertson , Scott Grigsby

Self-Initiated Open World Learning for Autonomous AI Agents

As more and more AI agents are used in practice, it is time to think about how to make these agents fully autonomous so that they can learn by themselves in a self-motivated and self-supervised manner rather than being retrained…

Artificial Intelligence · Computer Science 2024-03-01 Bing Liu , Eric Robertson , Scott Grigsby , Sahisnu Mazumder

Induction of Subgoal Automata for Reinforcement Learning

In this work we present ISA, a novel approach for learning and exploiting subgoals in reinforcement learning (RL). Our method relies on inducing an automaton whose transitions are subgoals expressed as propositional formulas over a set of…

Machine Learning · Computer Science 2019-12-02 Daniel Furelos-Blanco , Mark Law , Alessandra Russo , Krysia Broda , Anders Jonsson

Human-Interactive Subgoal Supervision for Efficient Inverse Reinforcement Learning

Humans are able to understand and perform complex tasks by strategically structuring the tasks into incremental steps or subgoals. For a robot attempting to learn to perform a sequential task with critical subgoal states, such states can…

Human-Computer Interaction · Computer Science 2018-06-25 Xinlei Pan , Eshed Ohn-Bar , Nicholas Rhinehart , Yan Xu , Yilin Shen , Kris M. Kitani

Making Human-Like Trade-offs in Constrained Environments by Learning from Demonstrations

Many real-life scenarios require humans to make difficult trade-offs: do we always follow all the traffic rules or do we violate the speed limit in an emergency? These scenarios force us to evaluate the trade-off between collective norms…

Artificial Intelligence · Computer Science 2021-09-24 Arie Glazier , Andrea Loreggia , Nicholas Mattei , Taher Rahgooy , Francesca Rossi , K. Brent Venable

Guideline Learning for In-context Information Extraction

Large language models (LLMs) can perform a new task by merely conditioning on task instructions and a few input-output examples, without optimizing any parameters. This is called In-Context Learning (ICL). In-context Information Extraction…

Computation and Language · Computer Science 2025-07-14 Chaoxu Pang , Yixuan Cao , Qiang Ding , Ping Luo

Collaborative Knowledge Infusion for Low-resource Stance Detection

Stance detection is the view towards a specific target by a given context (\textit{e.g.} tweets, commercial reviews). Target-related knowledge is often needed to assist stance detection models in understanding the target well and making…

Computation and Language · Computer Science 2024-03-29 Ming Yan , Joey Tianyi Zhou , Ivor W. Tsang

A Framework for Quantifying How Pre-Training and Context Benefit In-Context Learning

Pre-trained large language models have demonstrated a strong ability to learn from context, known as in-context learning (ICL). Despite a surge of recent applications that leverage such capabilities, it is by no means clear, at least…

Artificial Intelligence · Computer Science 2025-10-28 Bingqing Song , Jiaxiang Li , Rong Wang , Songtao Lu , Mingyi Hong

Self-Improving In-Context Learning

We propose to improve in-context learning (ICL) by optimizing the continuous embeddings of a fixed few-shot prompt at test time. The key observation is that the log-probabilities a model assigns to its demonstrated…

Computation and Language · Computer Science 2026-05-25 Baturay Saglam , Dionysis Kalogerias

Learning Procedural-aware Video Representations through State-Grounded Hierarchy Unfolding

Learning procedural-aware video representations is a key step towards building agents that can reason about and execute complex tasks. Existing methods typically address this problem by aligning visual content with textual descriptions at…

Computer Vision and Pattern Recognition · Computer Science 2025-11-26 Jinghan Zhao , Yifei Huang , Feng Lu

Feature-Based Interpretable Reinforcement Learning based on State-Transition Models

Growing concerns regarding the operational usage of AI models in the real-world has caused a surge of interest in explaining AI models' decisions to humans. Reinforcement Learning is not an exception in this regard. In this work, we propose…

Machine Learning · Computer Science 2023-10-06 Omid Davoodi , Majid Komeili

Learning states enhanced knowledge tracing: Simulating the diversity in real-world learning process

The Knowledge Tracing (KT) task focuses on predicting a learner's future performance based on the historical interactions. The knowledge state plays a key role in learning process. However, considering that the knowledge state is influenced…

Artificial Intelligence · Computer Science 2024-12-30 Shanshan Wang , Xueying Zhang , Keyang Wang , Xun Yang , Xingyi Zhang

Learner-aware Teaching: Inverse Reinforcement Learning with Preferences and Constraints

Inverse reinforcement learning (IRL) enables an agent to learn complex behavior by observing demonstrations from a (near-)optimal policy. The typical assumption is that the learner's goal is to match the teacher's demonstrated behavior. In…

Machine Learning · Computer Science 2019-10-30 Sebastian Tschiatschek , Ahana Ghosh , Luis Haug , Rati Devidze , Adish Singla

HyperPrompt: Prompt-based Task-Conditioning of Transformers

Prompt-Tuning is a new paradigm for finetuning pre-trained language models in a parameter-efficient way. Here, we explore the use of HyperNetworks to generate hyper-prompts: we propose HyperPrompt, a novel architecture for prompt-based…

Computation and Language · Computer Science 2022-06-16 Yun He , Huaixiu Steven Zheng , Yi Tay , Jai Gupta , Yu Du , Vamsi Aribandi , Zhe Zhao , YaGuang Li , Zhao Chen , Donald Metzler , Heng-Tze Cheng , Ed H. Chi

Grounding Hindsight Instructions in Multi-Goal Reinforcement Learning for Robotics

This paper focuses on robotic reinforcement learning with sparse rewards for natural language goal representations. An open problem is the sample-inefficiency that stems from the compositionality of natural language, and from the grounding…

Machine Learning · Computer Science 2022-09-12 Frank Röder , Manfred Eppe , Stefan Wermter

Learning When to Act: Interval-Aware Reinforcement Learning with Predictive Temporal Structure

Autonomous agents operating in continuous environments must decide not only what to do, but when to act. We introduce a lightweight adaptive temporal control system that learns the optimal interval between cognitive ticks from experience,…

Machine Learning · Computer Science 2026-03-27 Davide Di Gioia

Positional Knowledge is All You Need: Position-induced Transformer (PiT) for Operator Learning

Operator learning for Partial Differential Equations (PDEs) is rapidly emerging as a promising approach for surrogate modeling of intricate systems. Transformers with the self-attention mechanism$\unicode{x2013}$a powerful tool originally…

Machine Learning · Computer Science 2024-05-17 Junfeng Chen , Kailiang Wu

InT: Self-Proposed Interventions Enable Credit Assignment in LLM Reasoning

Outcome-reward reinforcement learning (RL) has proven effective at improving the reasoning capabilities of large language models (LLMs). However, standard RL assigns credit only at the level of the final answer, penalizing entire reasoning…

Machine Learning · Computer Science 2026-01-21 Matthew Y. R. Yang , Hao Bai , Ian Wu , Gene Yang , Amrith Setlur , Aviral Kumar

Decision Making for Autonomous Driving via Augmented Adversarial Inverse Reinforcement Learning

Making decisions in complex driving environments is a challenging task for autonomous agents. Imitation learning methods have great potentials for achieving such a goal. Adversarial Inverse Reinforcement Learning (AIRL) is one of the…

Artificial Intelligence · Computer Science 2021-03-29 Pin Wang , Dapeng Liu , Jiayu Chen , Hanhan Li , Ching-Yao Chan