Related papers: STAND: Self-Aware Precondition Induction for Inter…

Explicitly Modeling the Discriminability for Instance-Aware Visual Object Tracking

Visual object tracking performance has been dramatically improved in recent years, but some severe challenges remain open, like distractors and occlusions. We suspect the reason is that the feature representations of the tracking targets…

Computer Vision and Pattern Recognition · Computer Science 2021-10-29 Mengmeng Wang , Xiaoqian Yang , Yong Liu

iTAML: An Incremental Task-Agnostic Meta-learning Approach

Humans can continuously learn new knowledge as their experience grows. In contrast, previous learning in deep neural networks can quickly fade out when they are trained on a new task. In this paper, we hypothesize this problem can be…

Machine Learning · Computer Science 2020-03-27 Jathushan Rajasegaran , Salman Khan , Munawar Hayat , Fahad Shahbaz Khan , Mubarak Shah

ToolSelf: Unifying Task Execution and Self-Reconfiguration via Tool-Driven Intrinsic Adaptation

Agentic systems powered by Large Language Models (LLMs) have demonstrated remarkable potential in tackling complex, long-horizon tasks. However, their efficacy is fundamentally constrained by static configurations governing agent behaviors,…

Artificial Intelligence · Computer Science 2026-02-24 Jingqi Zhou , Sheng Wang , DeZhao Deng , Junwen Lu , Junwei Su , Qintong Li , Jiahui Gao , Hao Wu , Jiyue Jiang , Lingpeng Kong , Chuan Wu

AI Agents as Universal Task Solvers

We describe AI agents as stochastic dynamical systems and frame the problem of learning to reason as in transductive inference: Rather than approximating the distribution of past data as in classical induction, the objective is to capture…

Artificial Intelligence · Computer Science 2026-02-24 Alessandro Achille , Stefano Soatto

ATLAS: Adaptive Test-Time Latent Steering with External Verifiers for Enhancing LLMs Reasoning

Recent work on activation and latent steering has demonstrated that modifying internal representations can effectively guide large language models (LLMs) toward improved reasoning and efficiency without additional training. However, most…

Machine Learning · Computer Science 2026-01-07 Tuc Nguyen , Thai Le

Learning Correct Behavior from Examples: Validating Sequential Execution in Autonomous Agents

As autonomous agents become increasingly sophisticated, validating their sequential behavior presents a significant challenge. Traditional testing approaches require manual specification, exact sequence matching, or thousands of training…

Artificial Intelligence · Computer Science 2026-05-06 Reshabh K Sharma , Gaurav Mittal , Yu Hu

IN-RIL: Interleaved Reinforcement and Imitation Learning for Policy Fine-Tuning

Imitation learning (IL) and reinforcement learning (RL) each offer distinct advantages for robotics policy learning: IL provides stable learning from demonstrations, and RL promotes generalization through exploration. While existing robot…

Robotics · Computer Science 2025-05-16 Dechen Gao , Hang Wang , Hanchu Zhou , Nejib Ammar , Shatadal Mishra , Ahmadreza Moradipari , Iman Soltani , Junshan Zhang

Implicit Context-aware Learning and Discovery for Streaming Data Analytics

The performance of machine learning model can be further improved if contextual cues are provided as input along with base features that are directly related to an inference task. In offline learning, one can inspect historical training…

Machine Learning · Computer Science 2019-10-21 Kin Gwn Lore , Kishore K. Reddy

Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments

Imitation learning (IL) enables agents to mimic expert behaviors. Most previous IL techniques focus on precisely imitating one policy through mass demonstrations. However, in many applications, what humans require is the ability to perform…

Machine Learning · Computer Science 2023-10-10 Xiong-Hui Chen , Junyin Ye , Hang Zhao , Yi-Chen Li , Haoran Shi , Yu-Yan Xu , Zhihao Ye , Si-Hang Yang , Anqi Huang , Kai Xu , Zongzhang Zhang , Yang Yu

Learning Manipulation under Physics Constraints with Visual Perception

Understanding physical phenomena is a key competence that enables humans and animals to act and interact under uncertain perception in previously unseen environments containing novel objects and their configurations. In this work, we…

Robotics · Computer Science 2019-04-23 Wenbin Li , Aleš Leonardis , Jeannette Bohg , Mario Fritz

Position Prediction as an Effective Pretraining Strategy

Transformers have gained increasing popularity in a wide range of applications, including Natural Language Processing (NLP), Computer Vision and Speech Recognition, because of their powerful representational capacity. However, harnessing…

Machine Learning · Computer Science 2022-07-18 Shuangfei Zhai , Navdeep Jaitly , Jason Ramapuram , Dan Busbridge , Tatiana Likhomanenko , Joseph Yitan Cheng , Walter Talbott , Chen Huang , Hanlin Goh , Joshua Susskind

Autonomous Question Formation for Large Language Model-Driven AI Systems

Large language model (LLM)-driven AI systems are increasingly important for autonomous decision-making in dynamic and open environments. However, most existing systems rely on predefined tasks and fixed prompts, limiting their ability to…

Artificial Intelligence · Computer Science 2026-02-03 Hong Su

STELLA: Self-Evolving LLM Agent for Biomedical Research

The rapid growth of biomedical data, tools, and literature has created a fragmented research landscape that outpaces human expertise. While AI agents offer a solution, they typically rely on static, manually curated toolsets, limiting their…

Artificial Intelligence · Computer Science 2025-07-04 Ruofan Jin , Zaixi Zhang , Mengdi Wang , Le Cong

In-Place Test-Time Training

The static ``train then deploy" paradigm fundamentally limits Large Language Models (LLMs) from dynamically adapting their weights in response to continuous streams of new information inherent in real-world tasks. Test-Time Training (TTT)…

Machine Learning · Computer Science 2026-04-08 Guhao Feng , Shengjie Luo , Kai Hua , Ge Zhang , Di He , Wenhao Huang , Tianle Cai

Teaching-Assistant-in-the-Loop: Improving Knowledge Distillation from Imperfect Teacher Models in Low-Budget Scenarios

There is increasing interest in distilling task-specific knowledge from large language models (LLM) to smaller student models. Nonetheless, LLM distillation presents a dual challenge: 1) there is a high cost associated with querying the…

Computation and Language · Computer Science 2024-06-11 Yuhang Zhou , Wei Ai

Becoming self-instruct: introducing early stopping criteria for minimal instruct tuning

In this paper, we introduce the Instruction Following Score (IFS), a metric that detects language models' ability to follow instructions. The metric has a dual purpose. First, IFS can be used to distinguish between base and instruct models.…

Computation and Language · Computer Science 2023-07-10 Waseem AlShikh , Manhal Daaboul , Kirk Goddard , Brock Imel , Kiran Kamble , Parikshith Kulkarni , Melisa Russak

Reinforcement Learning without Ground-Truth State

To perform robot manipulation tasks, a low-dimensional state of the environment typically needs to be estimated. However, designing a state estimator can sometimes be difficult, especially in environments with deformable objects. An…

Robotics · Computer Science 2019-07-16 Xingyu Lin , Harjatin Singh Baweja , David Held

Temporal DINO: A Self-supervised Video Strategy to Enhance Action Prediction

The emerging field of action prediction plays a vital role in various computer vision applications such as autonomous driving, activity analysis and human-computer interaction. Despite significant advancements, accurately predicting future…

Computer Vision and Pattern Recognition · Computer Science 2023-08-22 Izzeddin Teeti , Rongali Sai Bhargav , Vivek Singh , Andrew Bradley , Biplab Banerjee , Fabio Cuzzolin

On the Feasibility of Learning, Rather than Assuming, Human Biases for Reward Inference

Our goal is for agents to optimize the right reward function, despite how difficult it is for us to specify what that is. Inverse Reinforcement Learning (IRL) enables us to infer reward functions from demonstrations, but it usually assumes…

Machine Learning · Computer Science 2019-06-25 Rohin Shah , Noah Gundotra , Pieter Abbeel , Anca D. Dragan

Self-Supervised Learning of Action Affordances as Interaction Modes

When humans perform a task with an articulated object, they interact with the object only in a handful of ways, while the space of all possible interactions is nearly endless. This is because humans have prior knowledge about what…

Computer Vision and Pattern Recognition · Computer Science 2023-05-30 Liquan Wang , Nikita Dvornik , Rafael Dubeau , Mayank Mittal , Animesh Garg