English
Related papers

Related papers: Tuning Language Models by Proxy

200 papers

Black-box tuning has attracted recent attention due to that the structure or inner parameters of advanced proprietary models are not accessible. Proxy-tuning provides a test-time output adjustment for tuning black-box language models. It…

Machine Learning · Computer Science 2024-07-02 Yuanyang He , Zitong Huang , Xinxing Xu , Rick Siow Mong Goh , Salman Khan , Wangmeng Zuo , Yong Liu , Chun-Mei Feng

Black-box tuning is an emerging paradigm for adapting large language models (LLMs) to better achieve desired behaviors, particularly when direct access to model parameters is unavailable. Current strategies, however, often present a dilemma…

Artificial Intelligence · Computer Science 2025-12-17 Zhikang Xie , Weilin Wan , Peizhu Gong , Weizhong Zhang , Cheng Jin

Performance prediction is a method to estimate the performance of Language Models (LMs) on various Natural Language Processing (NLP) tasks, mitigating computational costs associated with model capacity and data for fine-tuning. Our paper…

Computation and Language · Computer Science 2024-12-17 David Anugraha , Genta Indra Winata , Chenyue Li , Patrick Amadeus Irawan , En-Shiun Annie Lee

Methods for adapting language models (LMs) to new tasks and domains have traditionally assumed white-box access to the model, and work by modifying its parameters. However, this is incompatible with a recent trend in the field, where the…

Computation and Language · Computer Science 2023-05-29 Aitor Ormazabal , Mikel Artetxe , Eneko Agirre

In recent years, Large Language Models (LLMs) have demonstrated remarkable abilities in various natural language processing tasks. However, adapting these models to specialized domains using private datasets stored on resource-constrained…

Cryptography and Security · Computer Science 2025-03-20 Ziyao Wang , Yexiao He , Zheyu Shen , Yu Li , Guoheng Sun , Myungjin Lee , Ang Li

Deploying language models (LMs) in customer-facing speech applications requires conversational fluency and adherence to specific stylistic guidelines. This can be challenging to achieve reliably using complex system prompts due to issues…

Machine Learning · Computer Science 2025-07-08 Ingo Marquardt , Philippe Brule

It has been shown that Large Language Model (LLM) alignments can be circumvented by appending specially crafted attack suffixes with harmful queries to elicit harmful responses. To conduct attacks against private target models whose…

Prompting large language models has gained immense popularity in recent years due to the advantage of producing good results even without the need for labelled data. However, this requires prompt tuning to get optimal prompts that lead to…

Computation and Language · Computer Science 2024-03-06 Jacob-Junqi Tian , David Emerson , Sevil Zanjani Miyandoab , Deval Pandya , Laleh Seyyed-Kalantari , Faiza Khan Khattak

Fine-tuning enables large language models (LLMs) to adapt to specific domains, but often compromises their previously established safety alignment. To mitigate the degradation of model safety during fine-tuning, we introduce LookAhead…

Computation and Language · Computer Science 2025-12-22 Kangwei Liu , Mengru Wang , Yujie Luo , Lin Yuan , Mengshu Sun , Lei Liang , Zhiqiang Zhang , Jun Zhou , Bryan Hooi , Shumin Deng

Recently, prompt tuning \cite{lester2021power} has gradually become a new paradigm for NLP, which only depends on the representation of the words by freezing the parameters of pre-trained language models (PLMs) to obtain remarkable…

Computation and Language · Computer Science 2022-01-31 Pan He , Yuxi Chen , Yan Wang , Yanru Zhang

Several previous works concluded that the largest part of generation capabilities of large language models (LLM) are learned (early) during pre-training. However, LLMs still require further alignment to adhere to downstream task…

Computation and Language · Computer Science 2026-01-27 Ayoub Hammal , Pierre Zweigenbaum , Caio Corro

Large Language Model (LLM) alignment conventionally relies on supervised fine-tuning or reinforcement learning based alignment frameworks. These methods typically require labeled or preference datasets and involve updating model weights to…

Computation and Language · Computer Science 2025-03-21 Reem I. Masoud , Martin Ferianc , Philip Treleaven , Miguel Rodrigues

It is often desirable for Large Language Models (LLMs) to capture multiple objectives when providing a response. In document-grounded response generation, for example, agent responses are expected to be relevant to a user's query while also…

Computation and Language · Computer Science 2024-03-05 Keshav Ramji , Young-Suk Lee , Ramón Fernandez Astudillo , Md Arafat Sultan , Tahira Naseem , Asim Munawar , Radu Florian , Salim Roukos

The advent of large language models (LLMs) has revolutionized the field of text generation, producing outputs that closely mimic human-like writing. Although academic and industrial institutions have developed detectors to prevent the…

Machine Learning · Computer Science 2025-02-25 Tianchun Wang , Yuanzhou Chen , Zichuan Liu , Zhanwen Chen , Haifeng Chen , Xiang Zhang , Wei Cheng

Modern language models are trained almost exclusively on token sequences produced by a fixed tokenizer, an external lossless compressor often over UTF-8 byte sequences, thereby coupling the model to that compressor. This work introduces…

Computation and Language · Computer Science 2026-05-15 Lin Zheng , Xinyu Li , Qian Liu , Xiachong Feng , Lingpeng Kong

Post-hoc explanations provide transparency and are essential for guiding model optimization, such as prompt engineering and data sanitation. However, applying model-agnostic techniques to Large Language Models (LLMs) is hindered by…

Machine Learning · Computer Science 2026-04-13 Junhao Liu , Haonan Yu , Zhenyu Yan , Xin Zhang

Open-sourced large language models (LLMs) have demonstrated remarkable efficacy in various tasks with instruction tuning. However, these models can sometimes struggle with tasks that require more specialized knowledge such as translation.…

Computation and Language · Computer Science 2024-01-23 Jiali Zeng , Fandong Meng , Yongjing Yin , Jie Zhou

Learning from human feedback via proxy reward modeling has been studied to align Large Language Models (LLMs) with human values. However, achieving reliable training through that proxy reward model (RM) is not a trivial problem, and its…

Machine Learning · Computer Science 2024-10-08 Sungdong Kim , Minjoon Seo

Adapting large language models (LLMs) to low-resource languages (LRLs) is constrained by the scarcity of task data and computational resources. Although Proxy Tuning offers a logit-level strategy for introducing scaling effects, it often…

Computation and Language · Computer Science 2026-04-21 Chen Zhang , Jiuheng Lin , Zhiyuan Liao , Yansong Feng

Large Language Models (LLMs) have surged in popularity in recent months, but they have demonstrated concerning capabilities to generate harmful content when manipulated. While techniques like safety fine-tuning aim to minimize harmful use,…

Computation and Language · Computer Science 2024-02-16 Chawin Sitawarin , Norman Mu , David Wagner , Alexandre Araujo
‹ Prev 1 2 3 10 Next ›