Related papers: Watermarking Language Models with Error Correcting…

I Know You Did Not Write That! A Sampling Based Watermarking Method for Identifying Machine Generated Text

Potential harms of Large Language Models such as mass misinformation and plagiarism can be partially mitigated if there exists a reliable way to detect machine generated text. In this paper, we propose a new watermarking method to detect…

Computation and Language · Computer Science 2023-12-12 Kaan Efe Keleş , Ömer Kaan Gürbüz , Mucahid Kutlu

Three Bricks to Consolidate Watermarks for Large Language Models

The task of discerning between generated and natural texts is increasingly challenging. In this context, watermarking emerges as a promising technique for ascribing generated text to a specific model. It alters the sampling generation…

Computation and Language · Computer Science 2023-11-09 Pierre Fernandez , Antoine Chaffin , Karim Tit , Vivien Chappelier , Teddy Furon

A Certified Robust Watermark For Large Language Models

The effectiveness of watermark algorithms in AI-generated text identification has garnered significant attention. Concurrently, an increasing number of watermark algorithms have been proposed to enhance the robustness against various…

Cryptography and Security · Computer Science 2024-10-01 Xianheng Feng , Jian Liu , Kui Ren , Chun Chen

Revisiting the Robustness of Watermarking to Paraphrasing Attacks

Amidst rising concerns about the internet being proliferated with content generated from language models (LMs), watermarking is seen as a principled way to certify whether text was generated from a model. Many recent watermarking techniques…

Cryptography and Security · Computer Science 2024-11-11 Saksham Rastogi , Danish Pruthi

Let Watermarks Speak: A Robust and Unforgeable Watermark for Language Models

Watermarking is an effective way to trace model-generated content. Current watermark methods cannot resist forgery attacks, such as a deceptive claim that the model-generated content is a response to a fabricated prompt. None of them can be…

Cryptography and Security · Computer Science 2024-12-30 Minhao Bai

Debiasing Watermarks for Large Language Models via Maximal Coupling

Watermarking language models is essential for distinguishing between human and machine-generated text and thus maintaining the integrity and trustworthiness of digital communication. We present a novel green/red list watermarking approach…

Machine Learning · Statistics 2025-06-13 Yangxinyu Xie , Xiang Li , Tanwi Mallick , Weijie J. Su , Ruixun Zhang

Watermarking Text Generated by Black-Box Language Models

LLMs now exhibit human-like skills in various fields, leading to worries about misuse. Thus, detecting generated text is crucial. However, passive detection methods are stuck in domain specificity and limited adversarial robustness. To…

Computation and Language · Computer Science 2023-05-17 Xi Yang , Kejiang Chen , Weiming Zhang , Chang Liu , Yuang Qi , Jie Zhang , Han Fang , Nenghai Yu

Multi-Bit Distortion-Free Watermarking for Large Language Models

Methods for watermarking large language models have been proposed that distinguish AI-generated text from human-generated text by slightly altering the model output distribution, but they also distort the quality of the text, exposing the…

Computation and Language · Computer Science 2024-02-27 Massieh Kordi Boroujeny , Ya Jiang , Kai Zeng , Brian Mark

XMark: Reliable Multi-Bit Watermarking for LLM-Generated Texts

Multi-bit watermarking has emerged as a promising solution for embedding imperceptible binary messages into Large Language Model (LLM)-generated text, enabling reliable attribution and tracing of malicious usage of LLMs. Despite recent…

Computation and Language · Computer Science 2026-04-17 Jiahao Xu , Rui Hu , Olivera Kotevska , Zikai Zhang

Provably Robust Multi-bit Watermarking for AI-generated Text

Large Language Models (LLMs) have demonstrated remarkable capabilities of generating texts resembling human language. However, they can be misused by criminals to create deceptive content, such as fake news and phishing emails, which raises…

Cryptography and Security · Computer Science 2025-01-29 Wenjie Qu , Wengrui Zheng , Tianyang Tao , Dong Yin , Yanze Jiang , Zhihua Tian , Wei Zou , Jinyuan Jia , Jiaheng Zhang

Improved Unbiased Watermark for Large Language Models

As artificial intelligence surpasses human capabilities in text generation, the necessity to authenticate the origins of AI-generated content has become paramount. Unbiased watermarks offer a powerful solution by embedding statistical…

Computation and Language · Computer Science 2025-08-07 Ruibo Chen , Yihan Wu , Junfeng Guo , Heng Huang

Signal Watermark on Large Language Models

As Large Language Models (LLMs) become increasingly sophisticated, they raise significant security concerns, including the creation of fake news and academic misuse. Most detectors for identifying model-generated text are limited by their…

Cryptography and Security · Computer Science 2024-10-10 Zhenyu Xu , Victor S. Sheng

Watermarking Autoregressive Image Generation

Watermarking the outputs of generative models has emerged as a promising approach for tracking their provenance. Despite significant interest in autoregressive image generation models and their potential for misuse, no prior work has…

Machine Learning · Computer Science 2025-10-24 Nikola Jovanović , Ismail Labiad , Tomáš Souček , Martin Vechev , Pierre Fernandez

Unforgeable Watermarks for Language Models via Robust Signatures

Language models now routinely produce text that is difficult to distinguish from human writing, raising the need for robust tools to verify content provenance. Watermarking has emerged as a promising countermeasure, with existing work…

Cryptography and Security · Computer Science 2026-02-18 Huijia Lin , Kameron Shahabi , Min Jae Song

On the Reliability of Watermarks for Large Language Models

As LLMs become commonplace, machine-generated text has the potential to flood the internet with spam, social media bots, and valueless content. Watermarking is a simple and effective strategy for mitigating such harms by enabling the…

Machine Learning · Computer Science 2024-05-03 John Kirchenbauer , Jonas Geiping , Yuxin Wen , Manli Shu , Khalid Saifullah , Kezhi Kong , Kasun Fernando , Aniruddha Saha , Micah Goldblum , Tom Goldstein

Provable Robust Watermarking for AI-Generated Text

We study the problem of watermarking large language models (LLMs) generated text -- one of the most promising approaches for addressing the safety challenges of LLM usage. In this paper, we propose a rigorous theoretical framework to…

Computation and Language · Computer Science 2023-10-16 Xuandong Zhao , Prabhanjan Ananth , Lei Li , Yu-Xiang Wang

A Watermark for Large Language Models

Potential harms of large language models can be mitigated by watermarking model output, i.e., embedding signals into generated text that are invisible to humans but algorithmically detectable from a short span of tokens. We propose a…

Machine Learning · Computer Science 2024-05-03 John Kirchenbauer , Jonas Geiping , Yuxin Wen , Jonathan Katz , Ian Miers , Tom Goldstein

Is The Watermarking Of LLM-Generated Code Robust?

We present the first in depth study on the robustness of existing watermarking techniques applied to code generated by large language models (LLMs). As LLMs increasingly contribute to software development, watermarking has emerged as a…

Cryptography and Security · Computer Science 2025-08-21 Tarun Suresh , Shubham Ugare , Gagandeep Singh , Sasa Misailovic

Segmenting Watermarked Texts From Language Models

Watermarking is a technique that involves embedding nearly unnoticeable statistical signals within generated content to help trace its source. This work focuses on a scenario where an untrusted third-party user sends prompts to a trusted…

Machine Learning · Computer Science 2024-10-29 Xingchi Li , Guanxun Li , Xianyang Zhang

Mark My Words: Analyzing and Evaluating Language Model Watermarks

The capabilities of large language models have grown significantly in recent years and so too have concerns about their misuse. It is important to be able to distinguish machine-generated text from human-authored content. Prior works have…

Cryptography and Security · Computer Science 2024-10-15 Julien Piet , Chawin Sitawarin , Vivian Fang , Norman Mu , David Wagner