Related papers: STAND: Self-Aware Precondition Induction for Inter…

Test-Time Training on Video Streams

Prior work has established Test-Time Training (TTT) as a general framework to further improve a trained model at test time. Before making a prediction on each test instance, the model is first trained on the same instance using a…

Computer Vision and Pattern Recognition · Computer Science 2025-01-07 Renhao Wang , Yu Sun , Arnuv Tandon , Yossi Gandelsman , Xinlei Chen , Alexei A. Efros , Xiaolong Wang

Learning Time in Static Classifiers

Real-world visual data rarely presents as isolated, static instances. Instead, it often evolves gradually over time through variations in pose, lighting, object state, or scene context. However, conventional classifiers are typically…

Computer Vision and Pattern Recognition · Computer Science 2025-11-18 Xi Ding , Lei Wang , Piotr Koniusz , Yongsheng Gao

Learning with Opponent-Learning Awareness

Multi-agent settings are quickly gathering importance in machine learning. This includes a plethora of recent work on deep multi-agent reinforcement learning, but also can be extended to hierarchical RL, generative adversarial networks and…

Artificial Intelligence · Computer Science 2018-09-21 Jakob N. Foerster , Richard Y. Chen , Maruan Al-Shedivat , Shimon Whiteson , Pieter Abbeel , Igor Mordatch

Learning to Learn How to Learn: Self-Adaptive Visual Navigation Using Meta-Learning

Learning is an inherently continuous phenomenon. When humans learn a new task there is no explicit distinction between training and inference. As we learn a task, we keep learning about it while performing the task. What we learn and how we…

Computer Vision and Pattern Recognition · Computer Science 2019-03-28 Mitchell Wortsman , Kiana Ehsani , Mohammad Rastegari , Ali Farhadi , Roozbeh Mottaghi

PARL: Prompt-based Agents for Reinforcement Learning

Large language models (LLMs) have demonstrated high performance on tasks expressed in natural language, particularly in zero- or few-shot settings. These are typically framed as supervised (e.g., classification) or unsupervised (e.g.,…

Computation and Language · Computer Science 2026-02-27 Yarik Menchaca Resendiz , Roman Klinger

Align Your Intents: Offline Imitation Learning via Optimal Transport

Offline Reinforcement Learning (RL) addresses the problem of sequential decision-making by learning optimal policy through pre-collected data, without interacting with the environment. As yet, it has remained somewhat impractical, because…

Machine Learning · Computer Science 2024-10-07 Maksim Bobrin , Nazar Buzun , Dmitrii Krylov , Dmitry V. Dylov

A Framework and Method for Online Inverse Reinforcement Learning

Inverse reinforcement learning (IRL) is the problem of learning the preferences of an agent from the observations of its behavior on a task. While this problem has been well investigated, the related problem of {\em online} IRL---where the…

Machine Learning · Computer Science 2020-11-19 Saurabh Arora , Prashant Doshi , Bikramjit Banerjee

Instruct Me More! Random Prompting for Visual In-Context Learning

Large-scale models trained on extensive datasets, have emerged as the preferred approach due to their high generalizability across various tasks. In-context learning (ICL), a popular strategy in natural language processing, uses such models…

Computer Vision and Pattern Recognition · Computer Science 2023-11-08 Jiahao Zhang , Bowen Wang , Liangzhi Li , Yuta Nakashima , Hajime Nagahara

Continual Robot Learning using Self-Supervised Task Inference

Endowing robots with the human ability to learn a growing set of skills over the course of a lifetime as opposed to mastering single tasks is an open problem in robot learning. While multi-task learning approaches have been proposed to…

Robotics · Computer Science 2023-09-19 Muhammad Burhan Hafez , Stefan Wermter

DAIL: Beyond Task Ambiguity for Language-Conditioned Reinforcement Learning

Comprehending natural language and following human instructions are critical capabilities for intelligent agents. However, the flexibility of linguistic instructions induces substantial ambiguity across language-conditioned tasks, severely…

Artificial Intelligence · Computer Science 2025-10-24 Runpeng Xie , Quanwei Wang , Hao Hu , Zherui Zhou , Ni Mu , Xiyun Li , Yiqin Yang , Shuang Xu , Qianchuan Zhao , Bo XU

Learning the Preferences of a Learning Agent

For AI systems to be useful to humans, they must understand and act in accordance with our values and preferences. Since specifying preferences is a hard task, inverse reinforcement learning (IRL) aims to develop methods that allow for…

Artificial Intelligence · Computer Science 2026-05-12 Karim Abdel Sadek , Mark Bedaywi , Rhys Gould , Stuart Russell

A New Framework for Query Efficient Active Imitation Learning

We seek to align agent policy with human expert behavior in a reinforcement learning (RL) setting, without any prior knowledge about dynamics, reward function, and unsafe states. There is a human expert knowing the rewards and unsafe states…

Machine Learning · Computer Science 2020-01-01 Daniel Hsu

Learn to Exceed: Stereo Inverse Reinforcement Learning with Concurrent Policy Optimization

In this paper, we study the problem of obtaining a control policy that can mimic and then outperform expert demonstrations in Markov decision processes where the reward function is unknown to the learning agent. One main relevant approach…

Machine Learning · Computer Science 2020-09-24 Feng Tao , Yongcan Cao

Influencing Towards Stable Multi-Agent Interactions

Learning in multi-agent environments is difficult due to the non-stationarity introduced by an opponent's or partner's changing behaviors. Instead of reactively adapting to the other agent's (opponent or partner) behavior, we propose an…

Robotics · Computer Science 2021-10-18 Woodrow Z. Wang , Andy Shih , Annie Xie , Dorsa Sadigh

Journalism-Guided Agentic In-Context Learning for News Stance Detection

As online news consumption grows, personalized recommendation systems have become integral to digital journalism. However, these systems risk reinforcing filter bubbles and political polarization by failing to incorporate diverse…

Computation and Language · Computer Science 2025-09-23 Dahyun Lee , Jonghyeon Choi , Jiyoung Han , Kunwoo Park

Active Learning for Nonlinear System Identification with Guarantees

While the identification of nonlinear dynamical systems is a fundamental building block of model-based reinforcement learning and feedback control, its sample complexity is only understood for systems that either have discrete states and…

Machine Learning · Statistics 2020-06-19 Horia Mania , Michael I. Jordan , Benjamin Recht

The Path of Self-Evolving Large Language Models: Achieving Data-Efficient Learning via Intrinsic Feedback

Reinforcement learning (RL) has demonstrated potential in enhancing the reasoning capabilities of large language models (LLMs), but such training typically demands substantial efforts in creating and annotating data. In this work, we…

Computation and Language · Computer Science 2025-10-06 Hangfan Zhang , Siyuan Xu , Zhimeng Guo , Huaisheng Zhu , Shicheng Liu , Xinrun Wang , Qiaosheng Zhang , Yang Chen , Peng Ye , Lei Bai , Shuyue Hu

From Instance Training to Instruction Learning: Task Adapters Generation from Instructions

Large language models (LLMs) have acquired the ability to solve general tasks by utilizing instruction finetuning (IFT). However, IFT still relies heavily on instance training of extensive task data, which greatly limits the adaptability of…

Computation and Language · Computer Science 2025-02-19 Huanxuan Liao , Shizhu He , Yao Xu , Yuanzhe Zhang , Yanchao Hao , Shengping Liu , Kang Liu , Jun Zhao

Look Wide and Interpret Twice: Improving Performance on Interactive Instruction-following Tasks

There is a growing interest in the community in making an embodied AI agent perform a complicated task while interacting with an environment following natural language directives. Recent studies have tackled the problem using ALFRED, a…

Computer Vision and Pattern Recognition · Computer Science 2021-06-08 Van-Quang Nguyen , Masanori Suganuma , Takayuki Okatani

Test time training enhances in-context learning of nonlinear functions

Test-time training (TTT) enhances model performance by explicitly updating designated parameters prior to each prediction to adapt to the test data. While TTT has demonstrated considerable empirical success, its theoretical underpinnings…

Machine Learning · Statistics 2026-02-03 Kento Kuwataka , Taiji Suzuki