Related papers: STAND: Self-Aware Precondition Induction for Inter…

LLMs Could Autonomously Learn Without External Supervision

In the quest for super-human performance, Large Language Models (LLMs) have traditionally been tethered to human-annotated datasets and predefined training objectives-a process that is both labor-intensive and inherently limited. This paper…

Computation and Language · Computer Science 2024-06-10 Ke Ji , Junying Chen , Anningzhe Gao , Wenya Xie , Xiang Wan , Benyou Wang

PT4AL: Using Self-Supervised Pretext Tasks for Active Learning

Labeling a large set of data is expensive. Active learning aims to tackle this problem by asking to annotate only the most informative data from the unlabeled set. We propose a novel active learning approach that utilizes self-supervised…

Computer Vision and Pattern Recognition · Computer Science 2022-07-27 John Seon Keun Yi , Minseok Seo , Jongchan Park , Dong-Geol Choi

Transformers as Algorithms: Generalization and Stability in In-context Learning

In-context learning (ICL) is a type of prompting where a transformer model operates on a sequence of (input, output) examples and performs inference on-the-fly. In this work, we formalize in-context learning as an algorithm learning problem…

Machine Learning · Computer Science 2023-02-07 Yingcong Li , M. Emrullah Ildiz , Dimitris Papailiopoulos , Samet Oymak

Next-token pretraining implies in-context learning

We argue that in-context learning (ICL) predictably arises from standard self-supervised next-token pretraining, rather than being an exotic emergent property. This work establishes the foundational principles of this emergence by focusing…

Machine Learning · Computer Science 2025-07-15 Paul M. Riechers , Henry R. Bigelow , Eric A. Alt , Adam Shai

Self-Training for Sample-Efficient Active Learning for Text Classification with Pre-Trained Language Models

Active learning is an iterative labeling process that is used to obtain a small labeled subset, despite the absence of labeled data, thereby enabling to train a model for supervised tasks such as text classification. While active learning…

Computation and Language · Computer Science 2024-10-07 Christopher Schröder , Gerhard Heyer

DETAIL: Task DEmonsTration Attribution for Interpretable In-context Learning

In-context learning (ICL) allows transformer-based language models that are pre-trained on general text to quickly learn a specific task with a few "task demonstrations" without updating their parameters, significantly boosting their…

Computation and Language · Computer Science 2024-12-17 Zijian Zhou , Xiaoqiang Lin , Xinyi Xu , Alok Prakash , Daniela Rus , Bryan Kian Hsiang Low

Stance Detection Benchmark: How Robust Is Your Stance Detection?

Stance Detection (StD) aims to detect an author's stance towards a certain topic or claim and has become a key component in applications like fake news detection, claim validation, and argument search. However, while stance is easily…

Computation and Language · Computer Science 2020-01-07 Benjamin Schiller , Johannes Daxenberger , Iryna Gurevych

On the Inductive Bias of Stacking Towards Improving Reasoning

Given the increasing scale of model sizes, novel training strategies like gradual stacking [Gong et al., 2019, Reddi et al., 2023] have garnered interest. Stacking enables efficient training by gradually growing the depth of a model in…

Computation and Language · Computer Science 2024-10-01 Nikunj Saunshi , Stefani Karp , Shankar Krishnan , Sobhan Miryoosefi , Sashank J. Reddi , Sanjiv Kumar

InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation

To operate effectively in the real world, robots should integrate multimodal reasoning with precise action generation. However, existing vision-language-action (VLA) models often sacrifice one for the other, narrow their abilities to…

Robotics · Computer Science 2026-03-04 Shuai Yang , Hao Li , Bin Wang , Yilun Chen , Yang Tian , Tai Wang , Hanqing Wang , Feng Zhao , Yiyi Liao , Jiangmiao Pang

Mechanistic Fine-tuning for In-context Learning

In-context Learning (ICL) utilizes structured demonstration-query inputs to induce few-shot learning on Language Models (LMs), which are not originally pre-trained on ICL-style data. To bridge the gap between ICL and pre-training, some…

Computation and Language · Computer Science 2025-09-30 Hakaze Cho , Peng Luo , Mariko Kato , Rin Kaenbyou , Naoya Inoue

LLM-based Interactive Imitation Learning for Robotic Manipulation

Recent advancements in machine learning provide methods to train autonomous agents capable of handling the increasing complexity of sequential decision-making in robotics. Imitation Learning (IL) is a prominent approach, where agents learn…

Robotics · Computer Science 2025-05-01 Jonas Werner , Kun Chu , Cornelius Weber , Stefan Wermter

See, Think, Act: Teaching Multimodal Agents to Effectively Interact with GUI by Identifying Toggles

The advent of multimodal agents facilitates effective interaction within graphical user interface (GUI), especially in ubiquitous GUI control. However, their inability to reliably execute toggle control instructions remains a key…

Artificial Intelligence · Computer Science 2026-03-19 Zongru Wu , Rui Mao , Zhiyuan Tian , Pengzhou Cheng , Tianjie Ju , Zheng Wu , Lingzhong Dong , Haiyue Sheng , Zhuosheng Zhang , Gongshen Liu

Online Continual Learning For Interactive Instruction Following Agents

In learning an embodied agent executing daily tasks via language directives, the literature largely assumes that the agent learns all training data at the beginning. We argue that such a learning scenario is less realistic since a robotic…

Artificial Intelligence · Computer Science 2024-03-14 Byeonghwi Kim , Minhyuk Seo , Jonghyun Choi

Instance-Aware Predictive Navigation in Multi-Agent Environments

In this work, we aim to achieve efficient end-to-end learning of driving policies in dynamic multi-agent environments. Predicting and anticipating future events at the object level are critical for making informed driving decisions. We…

Robotics · Computer Science 2021-01-18 Jinkun Cao , Xin Wang , Trevor Darrell , Fisher Yu

SAND: Boosting LLM Agents with Self-Taught Action Deliberation

Large Language Model (LLM) agents are commonly tuned with supervised finetuning on ReAct-style expert trajectories or preference optimization over pairwise rollouts. Most of these methods focus on imitating specific expert behaviors or…

Computation and Language · Computer Science 2025-08-22 Yu Xia , Yiran Shen , Junda Wu , Tong Yu , Sungchul Kim , Ryan A. Rossi , Lina Yao , Julian McAuley

Skill-Targeted Adaptive Training

Language models often show little to no improvement (i.e., "saturation") when trained via vanilla supervised fine-tuning (SFT) on data similar to what they saw in their training set (e.g., MATH). We introduce a new fine-tuning strategy,…

Machine Learning · Computer Science 2025-10-14 Yinghui He , Abhishek Panigrahi , Yong Lin , Sanjeev Arora

Decision Boundary-aware Knowledge Consolidation Generates Better Instance-Incremental Learner

Instance-incremental learning (IIL) focuses on learning continually with data of the same classes. Compared to class-incremental learning (CIL), the IIL is seldom explored because IIL suffers less from catastrophic forgetting (CF). However,…

Machine Learning · Computer Science 2024-06-06 Qiang Nie , Weifu Fu , Yuhuan Lin , Jialin Li , Yifeng Zhou , Yong Liu , Lei Zhu , Chengjie Wang

Predicting Human Behavior in Autonomous Systems: A Collaborative Machine Teaching Approach for Reducing Transfer of Control Events

As autonomous systems become integral to various industries, effective strategies for fault handling are essential to ensure reliability and efficiency. Transfer of Control (ToC), a traditional approach for interrupting automated processes…

Robotics · Computer Science 2025-05-19 Julian Wolter , Amr Gomaa

Instance-dependent Early Stopping

In machine learning practice, early stopping has been widely used to regularize models and can save computational costs by halting the training process when the model's performance on a validation set stops improving. However, conventional…

Machine Learning · Computer Science 2025-02-12 Suqin Yuan , Runqi Lin , Lei Feng , Bo Han , Tongliang Liu

Instruction-Guided Scene Text Recognition

Multi-modal models have shown appealing performance in visual recognition tasks, as free-form text-guided training evokes the ability to understand fine-grained visual content. However, current models cannot be trivially applied to scene…

Computer Vision and Pattern Recognition · Computer Science 2025-01-03 Yongkun Du , Zhineng Chen , Yuchen Su , Caiyan Jia , Yu-Gang Jiang