English
Related papers

Related papers: Interactive Prompt Debugging with Sequence Salienc…

200 papers

In-context learning (ICL) has become one of the most popular learning paradigms. While there is a growing body of literature focusing on prompt engineering, there is a lack of systematic analysis comparing the effects of prompts across…

Computation and Language · Computer Science 2024-06-24 Sheng Lu , Hendrik Schuff , Iryna Gurevych

Saliency methods can make deep neural network predictions more interpretable by identifying a set of critical features in an input sample, such as pixels that contribute most strongly to a prediction made by an image classifier.…

Machine Learning · Computer Science 2021-06-15 Yang Lu , Wenbo Guo , Xinyu Xing , William Stafford Noble

Identifying the salience (i.e. importance) of discourse units is an important task in language understanding. While events play important roles in text documents, little research exists on analyzing their saliency status. This paper…

Computation and Language · Computer Science 2018-09-10 Zhengzhong Liu , Chenyan Xiong , Teruko Mitamura , Eduard Hovy

Continual learning aims to refine model parameters for new tasks while retaining knowledge from previous tasks. Recently, prompt-based learning has emerged to leverage pre-trained models to be prompted to learn subsequent tasks without the…

Computer Vision and Pattern Recognition · Computer Science 2024-03-19 Jisu Han , Jaemin Na , Wonjun Hwang

A fundamental bottleneck in utilising complex machine learning systems for critical applications has been not knowing why they do and what they do, thus preventing the development of any crucial safety protocols. To date, no method exist…

Machine Learning · Computer Science 2023-01-18 Jan Rosenzweig , Zoran Cvetkovic , Ivana Rosenzweig

Determining and ranking the most salient entities in a text is critical for user-facing systems, especially as users increasingly rely on models to interpret long documents they only partially read. Graded entity salience addresses this…

Computation and Language · Computer Science 2025-06-02 Jessica Lin , Amir Zeldes

Attention mechanism plays a dominant role in the sequence generation models and has been used to improve the performance of machine translation and abstractive text summarization. Different from neural machine translation, in the task of…

Computation and Language · Computer Science 2020-04-09 Piji Li , Lidong Bing , Zhongyu Wei , Wai Lam

Large Language Models (LLMs) excel at text summarization, a task that requires models to select content based on its importance. However, the exact notion of salience that LLMs have internalized remains unclear. To bridge this gap, we…

Computation and Language · Computer Science 2025-05-28 Jan Trienes , Jörg Schlötterer , Junyi Jessy Li , Christin Seifert

Saliency methods are a common class of machine learning interpretability techniques that calculate how important each input feature is to a model's output. We find that, with the rapid pace of development, users struggle to stay informed of…

Machine Learning · Computer Science 2023-06-01 Angie Boggust , Harini Suresh , Hendrik Strobelt , John V. Guttag , Arvind Satyanarayan

Saliency methods have been widely used to highlight important input features in model predictions. Most existing methods use backpropagation on a modified gradient function to generate saliency maps. Thus, noisy gradients can result in…

Computer Vision and Pattern Recognition · Computer Science 2021-11-30 Aya Abdelsalam Ismail , Héctor Corrada Bravo , Soheil Feizi

In this paper, we introduce a strategy for identifying textual saliency in large-scale language models applied to classification tasks. In visual networks where saliency is more well-studied, saliency is naturally localized through the…

Computation and Language · Computer Science 2023-08-11 Elizabeth M. Hou , Gregory Castanon

Salient object detection has seen remarkable progress driven by deep learning techniques. However, most of deep learning based salient object detection methods are black-box in nature and lacking in interpretability. This paper proposes the…

Computer Vision and Pattern Recognition · Computer Science 2017-08-21 Huaxin Xiao , Jiashi Feng , Yunchao Wei , Maojun Zhang

We present Prompt Cache, an approach for accelerating inference for large language models (LLM) by reusing attention states across different LLM prompts. Many input prompts have overlapping text segments, such as system messages, prompt…

Computation and Language · Computer Science 2024-04-26 In Gim , Guojun Chen , Seung-seob Lee , Nikhil Sarda , Anurag Khandelwal , Lin Zhong

Large Language Models (LLMs) are increasingly integrated into real-world applications, from virtual assistants to autonomous agents. However, their flexibility also introduces new attack vectors-particularly Prompt Injection (PI), where…

Cryptography and Security · Computer Science 2025-09-17 Mengxiao Wang , Yuxuan Zhang , Guofei Gu

Recent developments in machine learning have introduced models that approach human performance at the cost of increased architectural complexity. Efforts to make the rationales behind the models' predictions transparent have inspired an…

Computation and Language · Computer Science 2020-09-29 Pepa Atanasova , Jakob Grue Simonsen , Christina Lioma , Isabelle Augenstein

Saliency methods are widely used to interpret neural network predictions, but different variants of saliency methods often disagree even on the interpretations of the same prediction made by the same model. In these cases, how do we…

Computation and Language · Computer Science 2021-04-14 Shuoyang Ding , Philipp Koehn

High-quality saliency maps are essential in several machine learning application areas including explainable AI and weakly supervised object detection and segmentation. Many techniques have been developed to generate better saliency using…

Computer Vision and Pattern Recognition · Computer Science 2022-07-06 Osman Tursun , Simon Denman , Sridha Sridharan , Clinton Fookes

Sequential recommender systems have achieved significant success in modeling temporal user behavior but remain limited in capturing rich user semantics beyond interaction patterns. Large Language Models (LLMs) present opportunities to…

Masked Diffusion Language Models (MDLMs) enable parallel token decoding, providing a promising alternative to the sequential nature of autoregressive generation. However, their iterative denoising process remains computationally expensive…

Computation and Language · Computer Science 2026-03-10 Younjoo Lee , Junghoo Lee , Seungkyun Dan , Jaiyoung Park , Jung Ho Ahn

Vision-language models (VLMs) have achieved remarkable success across diverse tasks. However, concerns about their trustworthiness persist, particularly regarding tendencies to lean more on textual cues than visual evidence and the risk of…

Computer Vision and Pattern Recognition · Computer Science 2026-04-07 Shizhan Gong , Minda Hu , Qiyuan Zhang , Chen Ma , Qi Dou
‹ Prev 1 2 3 10 Next ›