Related papers: Tuning Language Models by Proxy

CPT: Consistent Proxy Tuning for Black-box Optimization

Black-box tuning has attracted recent attention due to that the structure or inner parameters of advanced proprietary models are not accessible. Proxy-tuning provides a test-time output adjustment for tuning black-box language models. It…

Machine Learning · Computer Science 2024-07-02 Yuanyang He , Zitong Huang , Xinxing Xu , Rick Siow Mong Goh , Salman Khan , Wangmeng Zuo , Yong Liu , Chun-Mei Feng

Advanced Black-Box Tuning of Large Language Models with Limited API Calls

Black-box tuning is an emerging paradigm for adapting large language models (LLMs) to better achieve desired behaviors, particularly when direct access to model parameters is unavailable. Current strategies, however, often present a dilemma…

Artificial Intelligence · Computer Science 2025-12-17 Zhikang Xie , Weilin Wan , Peizhu Gong , Weizhong Zhang , Cheng Jin

ProxyLM: Predicting Language Model Performance on Multilingual Tasks via Proxy Models

Performance prediction is a method to estimate the performance of Language Models (LMs) on various Natural Language Processing (NLP) tasks, mitigating computational costs associated with model capacity and data for fine-tuning. Our paper…

Computation and Language · Computer Science 2024-12-17 David Anugraha , Genta Indra Winata , Chenyue Li , Patrick Amadeus Irawan , En-Shiun Annie Lee

CombLM: Adapting Black-Box Language Models through Small Fine-Tuned Models

Methods for adapting language models (LMs) to new tasks and domains have traditionally assumed white-box access to the model, and work by modifying its parameters. However, this is incompatible with a recent trend in the field, where the…

Computation and Language · Computer Science 2023-05-29 Aitor Ormazabal , Mikel Artetxe , Eneko Agirre

Prada: Black-Box LLM Adaptation with Private Data on Resource-Constrained Devices

In recent years, Large Language Models (LLMs) have demonstrated remarkable abilities in various natural language processing tasks. However, adapting these models to specialized domains using private datasets stored on resource-constrained…

Cryptography and Security · Computer Science 2025-03-20 Ziyao Wang , Yexiao He , Zheyu Shen , Yu Li , Guoheng Sun , Myungjin Lee , Ang Li

Fine-tuning on simulated data outperforms prompting for agent tone of voice

Deploying language models (LMs) in customer-facing speech applications requires conversational fluency and adherence to specific stylistic guidelines. This can be challenging to achieve reliably using complex system prompts due to issues…

Machine Learning · Computer Science 2025-07-08 Ingo Marquardt , Philippe Brule

LoFT: Local Proxy Fine-tuning For Improving Transferability Of Adversarial Attacks Against Large Language Model

It has been shown that Large Language Model (LLM) alignments can be circumvented by appending specially crafted attack suffixes with harmful queries to elicit harmful responses. To conduct attacks against private target models whose…

Computation and Language · Computer Science 2023-10-24 Muhammad Ahmed Shah , Roshan Sharma , Hira Dhamyal , Raphael Olivier , Ankit Shah , Joseph Konan , Dareen Alharthi , Hazim T Bukhari , Massa Baali , Soham Deshmukh , Michael Kuhlmann , Bhiksha Raj , Rita Singh

Soft-prompt Tuning for Large Language Models to Evaluate Bias

Prompting large language models has gained immense popularity in recent years due to the advantage of producing good results even without the need for labelled data. However, this requires prompt tuning to get optimal prompts that lead to…

Computation and Language · Computer Science 2024-03-06 Jacob-Junqi Tian , David Emerson , Sevil Zanjani Miyandoab , Deval Pandya , Laleh Seyyed-Kalantari , Faiza Khan Khattak

LookAhead Tuning: Safer Language Models via Partial Answer Previews

Fine-tuning enables large language models (LLMs) to adapt to specific domains, but often compromises their previously established safety alignment. To mitigate the degradation of model safety during fine-tuning, we introduce LookAhead…

Computation and Language · Computer Science 2025-12-22 Kangwei Liu , Mengru Wang , Yujie Luo , Lin Yuan , Mengshu Sun , Lei Liang , Zhiqiang Zhang , Jun Zhou , Bryan Hooi , Shumin Deng

Protum: A New Method For Prompt Tuning Based on "[MASK]"

Recently, prompt tuning \cite{lester2021power} has gradually become a new paradigm for NLP, which only depends on the representation of the words by freezing the parameters of pre-trained language models (PLMs) to obtain remarkable…

Computation and Language · Computer Science 2022-01-31 Pan He , Yuxi Chen , Yan Wang , Yanru Zhang

Kad: A Framework for Proxy-based Test-time Alignment with Knapsack Approximation Deferral

Several previous works concluded that the largest part of generation capabilities of large language models (LLM) are learned (early) during pre-training. However, LLMs still require further alignment to adhere to downstream task…

Computation and Language · Computer Science 2026-01-27 Ayoub Hammal , Pierre Zweigenbaum , Caio Corro

Cultural Alignment in Large Language Models Using Soft Prompt Tuning

Large Language Model (LLM) alignment conventionally relies on supervised fine-tuning or reinforcement learning based alignment frameworks. These methods typically require labeled or preference datasets and involve updating model weights to…

Computation and Language · Computer Science 2025-03-21 Reem I. Masoud , Martin Ferianc , Philip Treleaven , Miguel Rodrigues

Self-Refinement of Language Models from External Proxy Metrics Feedback

It is often desirable for Large Language Models (LLMs) to capture multiple objectives when providing a response. In document-grounded response generation, for example, agent responses are expected to be relevant to a user's query while also…

Computation and Language · Computer Science 2024-03-05 Keshav Ramji , Young-Suk Lee , Ramón Fernandez Astudillo , Md Arafat Sultan , Tahira Naseem , Asim Munawar , Radu Florian , Salim Roukos

Humanizing the Machine: Proxy Attacks to Mislead LLM Detectors

The advent of large language models (LLMs) has revolutionized the field of text generation, producing outputs that closely mimic human-like writing. Although academic and industrial institutions have developed detectors to prevent the…

Machine Learning · Computer Science 2025-02-25 Tianchun Wang , Yuanzhou Chen , Zichuan Liu , Zhanwen Chen , Haifeng Chen , Xiang Zhang , Wei Cheng

Proxy Compression for Language Modeling

Modern language models are trained almost exclusively on token sequences produced by a fixed tokenizer, an external lossless compressor often over UTF-8 byte sequences, thereby coupling the model to that compressor. This work introduces…

Computation and Language · Computer Science 2026-05-15 Lin Zheng , Xinyu Li , Qian Liu , Xiachong Feng , Lingpeng Kong

Revitalizing Black-Box Interpretability: Actionable Interpretability for LLMs via Proxy Models

Post-hoc explanations provide transparency and are essential for guiding model optimization, such as prompt engineering and data sanitation. However, applying model-agnostic techniques to Large Language Models (LLMs) is hindered by…

Machine Learning · Computer Science 2026-04-13 Junhao Liu , Haonan Yu , Zhenyu Yan , Xin Zhang

TIM: Teaching Large Language Models to Translate with Comparison

Open-sourced large language models (LLMs) have demonstrated remarkable efficacy in various tasks with instruction tuning. However, these models can sometimes struggle with tasks that require more specialized knowledge such as translation.…

Computation and Language · Computer Science 2024-01-23 Jiali Zeng , Fandong Meng , Yongjing Yin , Jie Zhou

Rethinking the Role of Proxy Rewards in Language Model Alignment

Learning from human feedback via proxy reward modeling has been studied to align Large Language Models (LLMs) with human values. However, achieving reliable training through that proxy reward model (RM) is not a trivial problem, and its…

Machine Learning · Computer Science 2024-10-08 Sungdong Kim , Minjoon Seo

Efficient Low-Resource Language Adaptation via Multi-Source Dynamic Logit Fusion

Adapting large language models (LLMs) to low-resource languages (LRLs) is constrained by the scarcity of task data and computational resources. Although Proxy Tuning offers a logit-level strategy for introducing scaling effects, it often…

Computation and Language · Computer Science 2026-04-21 Chen Zhang , Jiuheng Lin , Zhiyuan Liao , Yansong Feng

PAL: Proxy-Guided Black-Box Attack on Large Language Models

Large Language Models (LLMs) have surged in popularity in recent months, but they have demonstrated concerning capabilities to generate harmful content when manipulated. While techniques like safety fine-tuning aim to minimize harmful use,…

Computation and Language · Computer Science 2024-02-16 Chawin Sitawarin , Norman Mu , David Wagner , Alexandre Araujo