Related papers: Incorporating Task-specific Concept Knowledge into…

Multimedia Generative Script Learning for Task Planning

Goal-oriented generative script learning aims to generate subsequent steps to reach a particular goal, which is an essential task to assist robots or humans in performing stereotypical activities. An important aspect of this process is the…

Computation and Language · Computer Science 2025-06-11 Qingyun Wang , Manling Li , Hou Pong Chan , Lifu Huang , Julia Hockenmaier , Girish Chowdhary , Heng Ji

Goal-Oriented Script Construction

The knowledge of scripts, common chains of events in stereotypical scenarios, is a valuable asset for task-oriented natural language understanding systems. We propose the Goal-Oriented Script Construction task, where a model produces a…

Computation and Language · Computer Science 2021-09-01 Qing Lyu , Li Zhang , Chris Callison-Burch

Concepts or Skills? Rethinking Instruction Selection for Multi-modal Models

Vision-language instruction tuning achieves two main purposes: learning visual concepts and learning visual skills. In this paper, we found that vision-language benchmarks fall into the dichotomy of mainly benefiting from training on…

Computer Vision and Pattern Recognition · Computer Science 2025-08-15 Andrew Bai , Justin Cui , Ruochen Wang , Cho-Jui Hsieh

Take a Break in the Middle: Investigating Subgoals towards Hierarchical Script Generation

Goal-oriented Script Generation is a new task of generating a list of steps that can fulfill the given goal. In this paper, we propose to extend the task from the perspective of cognitive theory. Instead of a simple flat structure, the…

Computation and Language · Computer Science 2023-05-19 Xinze Li , Yixin Cao , Muhao Chen , Aixin Sun

CSSL-MHTR: Continual Self-Supervised Learning for Scalable Multi-script Handwritten Text Recognition

Self-supervised learning has recently emerged as a strong alternative in document analysis. These approaches are now capable of learning high-quality image representations and overcoming the limitations of supervised methods, which require…

Computer Vision and Pattern Recognition · Computer Science 2024-04-30 Marwa Dhiaf , Mohamed Ali Souibgui , Kai Wang , Yuyang Liu , Yousri Kessentini , Alicia Fornés , Ahmed Cheikh Rouhou

An Error-Oriented Approach to Word Embedding Pre-Training

We propose a novel word embedding pre-training approach that exploits writing errors in learners' scripts. We compare our method to previous models that tune the embeddings based on script scores and the discrimination between correct and…

Computation and Language · Computer Science 2019-07-05 Youmna Farag , Marek Rei , Ted Briscoe

ProTPS: Prototype-Guided Text Prompt Selection for Continual Learning

For continual learning, text-prompt-based methods leverage text encoders and learnable prompts to encode semantic features for sequentially arrived classes over time. A common challenge encountered by existing works is how to learn unique…

Computer Vision and Pattern Recognition · Computer Science 2026-04-02 Jie Mei , Li-Leng Peng , Keith Fuller , Jenq-Neng Hwang

Aligning Script Events with Narrative Texts

Script knowledge plays a central role in text understanding and is relevant for a variety of downstream tasks. In this paper, we consider two recent datasets which provide a rich and general representation of script events in terms of…

Computation and Language · Computer Science 2019-05-21 Simon Ostermann , Michael Roth , Stefan Thater , Manfred Pinkal

Tracking with Human-Intent Reasoning

Advances in perception modeling have significantly improved the performance of object tracking. However, the current methods for specifying the target object in the initial frame are either by 1) using a box or mask template, or by 2)…

Computer Vision and Pattern Recognition · Computer Science 2024-01-01 Jiawen Zhu , Zhi-Qi Cheng , Jun-Yan He , Chenyang Li , Bin Luo , Huchuan Lu , Yifeng Geng , Xuansong Xie

Semantic Prompting with Image-Token for Continual Learning

Continual learning aims to refine model parameters for new tasks while retaining knowledge from previous tasks. Recently, prompt-based learning has emerged to leverage pre-trained models to be prompted to learn subsequent tasks without the…

Computer Vision and Pattern Recognition · Computer Science 2024-03-19 Jisu Han , Jaemin Na , Wonjun Hwang

A Generative Approach for Script Event Prediction via Contrastive Fine-tuning

Script event prediction aims to predict the subsequent event given the context. This requires the capability to infer the correlations between events. Recent works have attempted to improve event correlation reasoning by using pretrained…

Computation and Language · Computer Science 2022-12-12 Fangqi Zhu , Jun Gao , Changlong Yu , Wei Wang , Chen Xu , Xin Mu , Min Yang , Ruifeng Xu

Separate Scene Text Detector for Unseen Scripts is Not All You Need

Text detection in the wild is a well-known problem that becomes more challenging while handling multiple scripts. In the last decade, some scripts have gained the attention of the research community and achieved good detection performance.…

Computer Vision and Pattern Recognition · Computer Science 2023-08-01 Prateek Keserwani , Taveena Lotey , Rohit Keshari , Partha Pratim Roy

Learning to Learn with Contrastive Meta-Objective

Meta-learning enables learning systems to adapt quickly to new tasks, similar to humans. Different meta-learning approaches all work under/with the mini-batch episodic training framework. Such framework naturally gives the information about…

Machine Learning · Computer Science 2025-11-10 Shiguang Wu , Yaqing Wang , Yatao Bian , Quanming Yao

Identifying User Goals from UI Trajectories

Identifying underlying user goals and intents has been recognized as valuable in various personalization-oriented settings, such as personalized agents, improved search responses, advertising, user analytics, and more. In this paper, we…

Computation and Language · Computer Science 2025-03-04 Omri Berkovitch , Sapir Caduri , Noam Kahlon , Anatoly Efros , Avi Caciularu , Ido Dagan

Pre-training Text Representations as Meta Learning

Pre-training text representations has recently been shown to significantly improve the state-of-the-art in many natural language processing tasks. The central goal of pre-training is to learn text representations that are useful for…

Computation and Language · Computer Science 2020-04-14 Shangwen Lv , Yuechen Wang , Daya Guo , Duyu Tang , Nan Duan , Fuqing Zhu , Ming Gong , Linjun Shou , Ryan Ma , Daxin Jiang , Guihong Cao , Ming Zhou , Songlin Hu

Contrastive Multi-Task Dense Prediction

This paper targets the problem of multi-task dense prediction which aims to achieve simultaneous learning and inference on a bunch of multiple dense prediction tasks in a single framework. A core objective in design is how to effectively…

Computer Vision and Pattern Recognition · Computer Science 2023-07-18 Siwei Yang , Hanrong Ye , Dan Xu

More Than Just Attention: Improving Cross-Modal Attentions with Contrastive Constraints for Image-Text Matching

Cross-modal attention mechanisms have been widely applied to the image-text matching task and have achieved remarkable improvements thanks to its capability of learning fine-grained relevance across different modalities. However, the…

Computer Vision and Pattern Recognition · Computer Science 2022-10-05 Yuxiao Chen , Jianbo Yuan , Long Zhao , Tianlang Chen , Rui Luo , Larry Davis , Dimitris N. Metaxas

Better Captioning with Sequence-Level Exploration

Sequence-level learning objective has been widely used in captioning tasks to achieve the state-of-the-art performance for many models. In this objective, the model is trained by the reward on the quality of its generated captions…

Computer Vision and Pattern Recognition · Computer Science 2020-03-10 Jia Chen , Qin Jin

A Composable Specification Language for Reinforcement Learning Tasks

Reinforcement learning is a promising approach for learning control policies for robot tasks. However, specifying complex tasks (e.g., with multiple objectives and safety constraints) can be challenging, since the user must design a reward…

Machine Learning · Computer Science 2020-10-30 Kishor Jothimurugan , Rajeev Alur , Osbert Bastani

Towards Goal-oriented Intelligent Tutoring Systems in Online Education

Interactive Intelligent Tutoring Systems (ITSs) enhance traditional ITSs by promoting effective learning through interactions and problem resolution in online education. Yet, proactive engagement, prioritizing resource optimization with…

Computers and Society · Computer Science 2025-08-25 Yang Deng , Zifeng Ren , An Zhang , Tat-Seng Chua