English
Related papers

Related papers: STAND: Self-Aware Precondition Induction for Inter…

200 papers

Prior work has established Test-Time Training (TTT) as a general framework to further improve a trained model at test time. Before making a prediction on each test instance, the model is first trained on the same instance using a…

Computer Vision and Pattern Recognition · Computer Science 2025-01-07 Renhao Wang , Yu Sun , Arnuv Tandon , Yossi Gandelsman , Xinlei Chen , Alexei A. Efros , Xiaolong Wang

Real-world visual data rarely presents as isolated, static instances. Instead, it often evolves gradually over time through variations in pose, lighting, object state, or scene context. However, conventional classifiers are typically…

Computer Vision and Pattern Recognition · Computer Science 2025-11-18 Xi Ding , Lei Wang , Piotr Koniusz , Yongsheng Gao

Multi-agent settings are quickly gathering importance in machine learning. This includes a plethora of recent work on deep multi-agent reinforcement learning, but also can be extended to hierarchical RL, generative adversarial networks and…

Artificial Intelligence · Computer Science 2018-09-21 Jakob N. Foerster , Richard Y. Chen , Maruan Al-Shedivat , Shimon Whiteson , Pieter Abbeel , Igor Mordatch

Learning is an inherently continuous phenomenon. When humans learn a new task there is no explicit distinction between training and inference. As we learn a task, we keep learning about it while performing the task. What we learn and how we…

Computer Vision and Pattern Recognition · Computer Science 2019-03-28 Mitchell Wortsman , Kiana Ehsani , Mohammad Rastegari , Ali Farhadi , Roozbeh Mottaghi

Large language models (LLMs) have demonstrated high performance on tasks expressed in natural language, particularly in zero- or few-shot settings. These are typically framed as supervised (e.g., classification) or unsupervised (e.g.,…

Computation and Language · Computer Science 2026-02-27 Yarik Menchaca Resendiz , Roman Klinger

Offline Reinforcement Learning (RL) addresses the problem of sequential decision-making by learning optimal policy through pre-collected data, without interacting with the environment. As yet, it has remained somewhat impractical, because…

Machine Learning · Computer Science 2024-10-07 Maksim Bobrin , Nazar Buzun , Dmitrii Krylov , Dmitry V. Dylov

Inverse reinforcement learning (IRL) is the problem of learning the preferences of an agent from the observations of its behavior on a task. While this problem has been well investigated, the related problem of {\em online} IRL---where the…

Machine Learning · Computer Science 2020-11-19 Saurabh Arora , Prashant Doshi , Bikramjit Banerjee

Large-scale models trained on extensive datasets, have emerged as the preferred approach due to their high generalizability across various tasks. In-context learning (ICL), a popular strategy in natural language processing, uses such models…

Computer Vision and Pattern Recognition · Computer Science 2023-11-08 Jiahao Zhang , Bowen Wang , Liangzhi Li , Yuta Nakashima , Hajime Nagahara

Endowing robots with the human ability to learn a growing set of skills over the course of a lifetime as opposed to mastering single tasks is an open problem in robot learning. While multi-task learning approaches have been proposed to…

Robotics · Computer Science 2023-09-19 Muhammad Burhan Hafez , Stefan Wermter

Comprehending natural language and following human instructions are critical capabilities for intelligent agents. However, the flexibility of linguistic instructions induces substantial ambiguity across language-conditioned tasks, severely…

Artificial Intelligence · Computer Science 2025-10-24 Runpeng Xie , Quanwei Wang , Hao Hu , Zherui Zhou , Ni Mu , Xiyun Li , Yiqin Yang , Shuang Xu , Qianchuan Zhao , Bo XU

For AI systems to be useful to humans, they must understand and act in accordance with our values and preferences. Since specifying preferences is a hard task, inverse reinforcement learning (IRL) aims to develop methods that allow for…

Artificial Intelligence · Computer Science 2026-05-12 Karim Abdel Sadek , Mark Bedaywi , Rhys Gould , Stuart Russell

We seek to align agent policy with human expert behavior in a reinforcement learning (RL) setting, without any prior knowledge about dynamics, reward function, and unsafe states. There is a human expert knowing the rewards and unsafe states…

Machine Learning · Computer Science 2020-01-01 Daniel Hsu

In this paper, we study the problem of obtaining a control policy that can mimic and then outperform expert demonstrations in Markov decision processes where the reward function is unknown to the learning agent. One main relevant approach…

Machine Learning · Computer Science 2020-09-24 Feng Tao , Yongcan Cao

Learning in multi-agent environments is difficult due to the non-stationarity introduced by an opponent's or partner's changing behaviors. Instead of reactively adapting to the other agent's (opponent or partner) behavior, we propose an…

Robotics · Computer Science 2021-10-18 Woodrow Z. Wang , Andy Shih , Annie Xie , Dorsa Sadigh

As online news consumption grows, personalized recommendation systems have become integral to digital journalism. However, these systems risk reinforcing filter bubbles and political polarization by failing to incorporate diverse…

Computation and Language · Computer Science 2025-09-23 Dahyun Lee , Jonghyeon Choi , Jiyoung Han , Kunwoo Park

While the identification of nonlinear dynamical systems is a fundamental building block of model-based reinforcement learning and feedback control, its sample complexity is only understood for systems that either have discrete states and…

Machine Learning · Statistics 2020-06-19 Horia Mania , Michael I. Jordan , Benjamin Recht

Reinforcement learning (RL) has demonstrated potential in enhancing the reasoning capabilities of large language models (LLMs), but such training typically demands substantial efforts in creating and annotating data. In this work, we…

Computation and Language · Computer Science 2025-10-06 Hangfan Zhang , Siyuan Xu , Zhimeng Guo , Huaisheng Zhu , Shicheng Liu , Xinrun Wang , Qiaosheng Zhang , Yang Chen , Peng Ye , Lei Bai , Shuyue Hu

Large language models (LLMs) have acquired the ability to solve general tasks by utilizing instruction finetuning (IFT). However, IFT still relies heavily on instance training of extensive task data, which greatly limits the adaptability of…

Computation and Language · Computer Science 2025-02-19 Huanxuan Liao , Shizhu He , Yao Xu , Yuanzhe Zhang , Yanchao Hao , Shengping Liu , Kang Liu , Jun Zhao

There is a growing interest in the community in making an embodied AI agent perform a complicated task while interacting with an environment following natural language directives. Recent studies have tackled the problem using ALFRED, a…

Computer Vision and Pattern Recognition · Computer Science 2021-06-08 Van-Quang Nguyen , Masanori Suganuma , Takayuki Okatani

Test-time training (TTT) enhances model performance by explicitly updating designated parameters prior to each prediction to adapt to the test data. While TTT has demonstrated considerable empirical success, its theoretical underpinnings…

Machine Learning · Statistics 2026-02-03 Kento Kuwataka , Taiji Suzuki