English
Related papers

Related papers: Consecutive Batch Model Editing with HooK Layers

200 papers

The imperative task of revising or updating the knowledge stored within large language models arises from two distinct sources: intrinsic errors inherent in the model which should be corrected and outdated knowledge due to external shifts…

Computation and Language · Computer Science 2023-12-15 Xunjian Yin , Jin Jiang , Liming Yang , Xiaojun Wan

Recent advances in visual generative models have enabled high-fidelity image editing guided by human instructions. However, these models often struggle with complex instructions involving combinatorial editing operations or inter-step…

Computer Vision and Pattern Recognition · Computer Science 2026-05-12 Zilai Zeng , Mingdeng Cao , Zijie Li , Xiaochen Lian , Yichun Shi , Peihao Zhu , Chen Sun , Peng Wang

Large Language Models often contain factually incorrect or outdated knowledge, giving rise to model editing methods for precise knowledge updates. However, current mainstream locate-then-edit approaches exhibit a progressive performance…

Computation and Language · Computer Science 2025-10-28 Peng Wang , Biyu Zhou , Xuehai Tang , Jizhong Han , Songlin Hu

Concept Bottleneck Models (CBMs) have garnered much attention for their ability to elucidate the prediction process through a human-understandable concept layer. However, most previous studies focused on static scenarios where the data and…

Machine Learning · Computer Science 2026-01-05 Hongbin Lin , Chenyang Ren , Juangui Xu , Zhengyu Hu , Cheng-Long Wang , Yao Shu , Hui Xiong , Jingfeng Zhang , Di Wang , Lijie Hu

Deep learning models in recommender systems are usually trained in the batch mode, namely iteratively trained on a fixed-size window of training data. Such batch mode training of deep learning models suffers from low training efficiency,…

Information Retrieval · Computer Science 2020-09-07 Yichao Wang , Huifeng Guo , Ruiming Tang , Zhirong Liu , Xiuqiang He

Instruction-based image editing has emerged as a prominent research area, which, benefiting from image generation foundation models, have achieved high aesthetic quality, making instruction-following capability the primary challenge.…

Computer Vision and Pattern Recognition · Computer Science 2025-12-08 Hongyu Li , Manyuan Zhang , Dian Zheng , Ziyu Guo , Yimeng Jia , Kaituo Feng , Hao Yu , Yexin Liu , Yan Feng , Peng Pei , Xunliang Cai , Linjiang Huang , Hongsheng Li , Si Liu

Deep neural networks are susceptible to catastrophic forgetting when trained on sequential tasks. Various continual learning (CL) methods often rely on exemplar buffers or/and network expansion for balancing model stability and plasticity,…

Machine Learning · Computer Science 2024-01-18 Depeng Li , Tianqi Wang , Junwei Chen , Qining Ren , Kenji Kawaguchi , Zhigang Zeng

We propose a framework for training non-autoregressive sequence-to-sequence models for editing tasks, where the original input sequence is iteratively edited to produce the output. We show that the imitation learning algorithms designed to…

Computation and Language · Computer Science 2022-03-18 Sweta Agrawal , Marine Carpuat

As the world changes, we need to be able to update our models and correct false information without costly retraining. Knowledge-based model editing enables precise modifications to the weights of large language models in order to modify…

Machine Learning · Computer Science 2025-02-28 Elan Markowitz , Anil Ramakrishna , Ninareh Mehrabi , Charith Peris , Rahul Gupta , Kai-Wei Chang , Aram Galstyan

Existing GUI agent models relying on coordinate-based one-step visual grounding struggle with generalizing to varying input resolutions and aspect ratios. Alternatives introduce coordinate-free strategies yet suffer from learning under…

Machine Learning · Computer Science 2026-02-04 Xiaoce Wang , Guibin Zhang , Junzhe Li , Jinzhe Tu , Chun Li , Ming Li

Knowledge Editing is a technique that updates large language models (LLMs) with new information to maintain their world knowledge. This approach avoids the need to rebuild the model from scratch, thereby addressing the high costs associated…

Computation and Language · Computer Science 2025-09-09 Changyue Wang , Weihang Su , Qingyao Ai , Yichen Tang , Yiqun Liu

Concept Bottleneck Models (CBMs) map the inputs onto a set of interpretable concepts (``the bottleneck'') and use the concepts to make predictions. A concept bottleneck enhances interpretability since it can be investigated to understand…

Machine Learning · Computer Science 2023-02-03 Mert Yuksekgonul , Maggie Wang , James Zou

In many real-world scenarios, data to train machine learning models becomes available over time. Unfortunately, these models struggle to continually learn new concepts without forgetting what has been learnt in the past. This phenomenon is…

Computation and Language · Computer Science 2023-01-16 Beyza Ermis , Giovanni Zappella , Martin Wistuba , Aditya Rawal , Cedric Archambeau

Concept Bottleneck Models (CBMs) have garnered much attention for their ability to elucidate the prediction process through a humanunderstandable concept layer. However, most previous studies focused on cases where the data, including…

Machine Learning · Computer Science 2025-02-04 Lijie Hu , Chenyang Ren , Zhengyu Hu , Hongbin Lin , Cheng-Long Wang , Hui Xiong , Jingfeng Zhang , Di Wang

Most existing sequence generation models produce outputs in one pass, usually left-to-right. However, this is in contrast with a more natural approach that humans use in generating content; iterative refinement and editing. Recent work has…

Computation and Language · Computer Science 2022-05-26 Machel Reid , Graham Neubig

Multi-step reasoning ability of large language models is crucial in tasks such as math and tool utilization. Current researches predominantly focus on enhancing model performance in these multi-step reasoning tasks through fine-tuning with…

Computation and Language · Computer Science 2024-10-23 Yuli Qiu , Jiashu Yao , Heyan Huang , Yuhang Guo

Recent research has shown that language models have a tendency to memorize rare or unique sequences in the training corpora which can thus leak sensitive attributes of user data. We employ a teacher-student framework and propose a novel…

Computation and Language · Computer Science 2022-10-14 Zhe Liu , Xuedong Zhang , Fuchun Peng

Large language models (LLMs) struggle with hallucinations due to false or outdated knowledge. Given the high resource demands of retraining these models, there is an increasing focus on developing model editing. However, the general…

Computation and Language · Computer Science 2026-04-13 Hao-Xiang Xu , Jun-Yu Ma , Zhen-Hua Ling , Ningyu Zhang , Jia-Chen Gu

Prompt-based continual learning methods fine-tune only a small set of additional learnable parameters while keeping the pre-trained model's parameters frozen. It enables efficient adaptation to new tasks while mitigating the risk of…

Computer Vision and Pattern Recognition · Computer Science 2026-05-11 Shengqin Jiang , Tianqi Kong , Yuankai Qi , Haokui Zhang , Lina Yao , Quan Z. Sheng , Qingshan Liu , Ming-Hsuan Yang

Given an original image, image editing aims to generate an image that align with the provided instruction. The challenges are to accept multimodal inputs as instructions and a scarcity of high-quality training data, including crucial…

Computer Vision and Pattern Recognition · Computer Science 2024-12-17 Zhen Han , Chaojie Mao , Zeyinzi Jiang , Yulin Pan , Jingfeng Zhang
‹ Prev 1 2 3 10 Next ›