Related papers: STAND: Self-Aware Precondition Induction for Inter…

Autonomous Open-Ended Learning of Tasks with Non-Stationary Interdependencies

Autonomous open-ended learning is a relevant approach in machine learning and robotics, allowing the design of artificial agents able to acquire goals and motor skills without the necessity of user assigned tasks. A crucial issue for this…

Machine Learning · Computer Science 2022-05-17 Alejandro Romero , Gianluca Baldassarre , Richard J. Duro , Vieri Giuliano Santucci

Human AI interaction loop training: New approach for interactive reinforcement learning

Reinforcement Learning (RL) in various decision-making tasks of machine learning provides effective results with an agent learning from a stand-alone reward function. However, it presents unique challenges with large amounts of environment…

Machine Learning · Computer Science 2020-03-10 Neda Navidi

Self-Adaptive In-Context Learning: An Information Compression Perspective for In-Context Example Selection and Ordering

Despite the surprising few-shot performance of in-context learning (ICL), it is still a common practice to randomly sample examples to serve as context. This paper advocates a new principle for ICL: self-adaptive in-context learning. The…

Computation and Language · Computer Science 2023-05-04 Zhiyong Wu , Yaoxiang Wang , Jiacheng Ye , Lingpeng Kong

Behavior Self-Organization Supports Task Inference for Continual Robot Learning

Recent advances in robot learning have enabled robots to become increasingly better at mastering a predefined set of tasks. On the other hand, as humans, we have the ability to learn a growing set of tasks over our lifetime. Continual robot…

Robotics · Computer Science 2021-12-21 Muhammad Burhan Hafez , Stefan Wermter

Prompt-based Context- and Domain-aware Pretraining for Vision and Language Navigation

Pretrained visual-language models have extensive world knowledge and are widely used in visual and language navigation (VLN). However, they are not sensitive to indoor scenarios for VLN tasks. Another challenge for VLN is how the agent…

Computer Vision and Pattern Recognition · Computer Science 2023-12-15 Ting Liu , Yue Hu , Wansen Wu , Youkai Wang , Kai Xu , Quanjun Yin

Logical Specifications-guided Dynamic Task Sampling for Reinforcement Learning Agents

Reinforcement Learning (RL) has made significant strides in enabling artificial agents to learn diverse behaviors. However, learning an effective policy often requires a large number of environment interactions. To mitigate sample…

Artificial Intelligence · Computer Science 2024-04-04 Yash Shukla , Tanushree Burman , Abhishek Kulkarni , Robert Wright , Alvaro Velasquez , Jivko Sinapov

Persistent Rule-based Interactive Reinforcement Learning

Interactive reinforcement learning has allowed speeding up the learning process in autonomous agents by including a human trainer providing extra information to the agent in real-time. Current interactive reinforcement learning research has…

Artificial Intelligence · Computer Science 2021-09-06 Adam Bignold , Francisco Cruz , Richard Dazeley , Peter Vamplew , Cameron Foale

Incremental Self-training for Semi-supervised Learning

Semi-supervised learning provides a solution to reduce the dependency of machine learning on labeled data. As one of the efficient semi-supervised techniques, self-training (ST) has received increasing attention. Several advancements have…

Machine Learning · Computer Science 2024-04-22 Jifeng Guo , Zhulin Liu , Tong Zhang , C. L. Philip Chen

Random Policy Enables In-Context Reinforcement Learning within Trust Horizons

Pretrained foundation models have exhibited extraordinary in-context learning performance, allowing zero-shot generalization to new tasks not encountered during pretraining. In the case of reinforcement learning (RL), in-context RL (ICRL)…

Machine Learning · Computer Science 2025-05-05 Weiqin Chen , Santiago Paternain

LEAD: Iterative Data Selection for Efficient LLM Instruction Tuning

Instruction tuning has emerged as a critical paradigm for improving the capabilities and alignment of large language models (LLMs). However, existing iterative model-aware data selection methods incur significant computational overhead, as…

Machine Learning · Computer Science 2025-05-13 Xiaotian Lin , Yanlin Qi , Yizhang Zhu , Themis Palpanas , Chengliang Chai , Nan Tang , Yuyu Luo

Data Curation Alone Can Stabilize In-context Learning

In-context learning (ICL) enables large language models (LLMs) to perform new tasks by prompting them with a sequence of training examples. However, it is known that ICL is very sensitive to the choice of training examples: randomly…

Computation and Language · Computer Science 2023-09-13 Ting-Yun Chang , Robin Jia

Towards Automatic Continual Learning: A Self-Adaptive Framework for Continual Instruction Tuning

Continual instruction tuning enables large language models (LLMs) to learn incrementally while retaining past knowledge, whereas existing methods primarily focus on how to retain old knowledge rather than on selecting which new knowledge to…

Computation and Language · Computer Science 2025-03-21 Peiyi Lin , Fukai Zhang , Kai Niu , Hao Fu

Addressing Limitations of State-Aware Imitation Learning for Autonomous Driving

Conditional Imitation learning is a common and effective approach to train autonomous driving agents. However, two issues limit the full potential of this approach: (i) the inertia problem, a special case of causal confusion where the agent…

Computer Vision and Pattern Recognition · Computer Science 2023-11-01 Luca Cultrera , Federico Becattini , Lorenzo Seidenari , Pietro Pala , Alberto Del Bimbo

Introspective X Training: Feedback Conditioning Improves Scaling Across all LLM Training Stages

We tackle the question of how to scale more efficiently across the many, ever-growing stages of current LLM training pipelines. Our guiding intuition stems from the fact that the dynamics of later stages of the pipeline, e.g. post-training,…

Machine Learning · Computer Science 2026-05-21 Brandon Cui , Ximing Lu , Jaehun Jung , Syeda Nahida Akter , Hyunwoo Kim , Yuxiao Qu , David Acuna , Shrimai Prabhumoye , Yejin Choi , Prithviraj Ammanabrolu

When Context Sticks: Studying Interference in In-Context Learning

This paper investigates context stickiness in in-context learning (ICL), a phenomenon where earlier examples in a prompt interfere with a transformer's ability to adapt to later tasks. Using synthetic regression tasks over linear and…

Machine Learning · Computer Science 2026-04-28 Hanna Rød , Dagny Streit , Nils Valseth Selte , Justin Li

Tactile Active Inference Reinforcement Learning for Efficient Robotic Manipulation Skill Acquisition

Robotic manipulation holds the potential to replace humans in the execution of tedious or dangerous tasks. However, control-based approaches are not suitable due to the difficulty of formally describing open-world manipulation in reality,…

Robotics · Computer Science 2023-11-21 Zihao Liu , Xing Liu , Yizhai Zhang , Zhengxiong Liu , Panfeng Huang

INST-IT: Boosting Instance Understanding via Explicit Visual Prompt Instruction Tuning

Large Multimodal Models (LMMs) have made significant breakthroughs with the advancement of instruction tuning. However, while existing models can understand images and videos at a holistic level, they still struggle with instance-level…

Computer Vision and Pattern Recognition · Computer Science 2026-03-17 Wujian Peng , Lingchen Meng , Yitong Chen , Yiweng Xie , Yang Liu , Tao Gui , Hang Xu , Xipeng Qiu , Zuxuan Wu , Yu-Gang Jiang

Induction Signatures Are Not Enough: A Matched-Compute Study of Load-Bearing Structure in In-Context Learning

Mechanism-targeted synthetic data is increasingly proposed as a way to steer pretraining toward desirable capabilities, but it remains unclear how such interventions should be evaluated. We study this question for in-context learning (ICL)…

Computation and Language · Computer Science 2026-03-17 Mohammed Sabry , Anya Belz

Learning Action Conditions from Instructional Manuals for Instruction Understanding

The ability to infer pre- and postconditions of an action is vital for comprehending complex instructions, and is essential for applications such as autonomous instruction-guided agents and assistive AI that supports humans to perform…

Computation and Language · Computer Science 2024-07-04 Te-Lin Wu , Caiqi Zhang , Qingyuan Hu , Alex Spangher , Nanyun Peng

Stanceformer: Target-Aware Transformer for Stance Detection

The task of Stance Detection involves discerning the stance expressed in a text towards a specific subject or target. Prior works have relied on existing transformer models that lack the capability to prioritize targets effectively.…

Computation and Language · Computer Science 2024-10-10 Krishna Garg , Cornelia Caragea