English
Related papers

Related papers: Watermarking Language Models with Error Correcting…

200 papers

Potential harms of Large Language Models such as mass misinformation and plagiarism can be partially mitigated if there exists a reliable way to detect machine generated text. In this paper, we propose a new watermarking method to detect…

Computation and Language · Computer Science 2023-12-12 Kaan Efe Keleş , Ömer Kaan Gürbüz , Mucahid Kutlu

The task of discerning between generated and natural texts is increasingly challenging. In this context, watermarking emerges as a promising technique for ascribing generated text to a specific model. It alters the sampling generation…

Computation and Language · Computer Science 2023-11-09 Pierre Fernandez , Antoine Chaffin , Karim Tit , Vivien Chappelier , Teddy Furon

The effectiveness of watermark algorithms in AI-generated text identification has garnered significant attention. Concurrently, an increasing number of watermark algorithms have been proposed to enhance the robustness against various…

Cryptography and Security · Computer Science 2024-10-01 Xianheng Feng , Jian Liu , Kui Ren , Chun Chen

Amidst rising concerns about the internet being proliferated with content generated from language models (LMs), watermarking is seen as a principled way to certify whether text was generated from a model. Many recent watermarking techniques…

Cryptography and Security · Computer Science 2024-11-11 Saksham Rastogi , Danish Pruthi

Watermarking is an effective way to trace model-generated content. Current watermark methods cannot resist forgery attacks, such as a deceptive claim that the model-generated content is a response to a fabricated prompt. None of them can be…

Cryptography and Security · Computer Science 2024-12-30 Minhao Bai

Watermarking language models is essential for distinguishing between human and machine-generated text and thus maintaining the integrity and trustworthiness of digital communication. We present a novel green/red list watermarking approach…

Machine Learning · Statistics 2025-06-13 Yangxinyu Xie , Xiang Li , Tanwi Mallick , Weijie J. Su , Ruixun Zhang

LLMs now exhibit human-like skills in various fields, leading to worries about misuse. Thus, detecting generated text is crucial. However, passive detection methods are stuck in domain specificity and limited adversarial robustness. To…

Computation and Language · Computer Science 2023-05-17 Xi Yang , Kejiang Chen , Weiming Zhang , Chang Liu , Yuang Qi , Jie Zhang , Han Fang , Nenghai Yu

Methods for watermarking large language models have been proposed that distinguish AI-generated text from human-generated text by slightly altering the model output distribution, but they also distort the quality of the text, exposing the…

Computation and Language · Computer Science 2024-02-27 Massieh Kordi Boroujeny , Ya Jiang , Kai Zeng , Brian Mark

Multi-bit watermarking has emerged as a promising solution for embedding imperceptible binary messages into Large Language Model (LLM)-generated text, enabling reliable attribution and tracing of malicious usage of LLMs. Despite recent…

Computation and Language · Computer Science 2026-04-17 Jiahao Xu , Rui Hu , Olivera Kotevska , Zikai Zhang

Large Language Models (LLMs) have demonstrated remarkable capabilities of generating texts resembling human language. However, they can be misused by criminals to create deceptive content, such as fake news and phishing emails, which raises…

Cryptography and Security · Computer Science 2025-01-29 Wenjie Qu , Wengrui Zheng , Tianyang Tao , Dong Yin , Yanze Jiang , Zhihua Tian , Wei Zou , Jinyuan Jia , Jiaheng Zhang

As artificial intelligence surpasses human capabilities in text generation, the necessity to authenticate the origins of AI-generated content has become paramount. Unbiased watermarks offer a powerful solution by embedding statistical…

Computation and Language · Computer Science 2025-08-07 Ruibo Chen , Yihan Wu , Junfeng Guo , Heng Huang

As Large Language Models (LLMs) become increasingly sophisticated, they raise significant security concerns, including the creation of fake news and academic misuse. Most detectors for identifying model-generated text are limited by their…

Cryptography and Security · Computer Science 2024-10-10 Zhenyu Xu , Victor S. Sheng

Watermarking the outputs of generative models has emerged as a promising approach for tracking their provenance. Despite significant interest in autoregressive image generation models and their potential for misuse, no prior work has…

Machine Learning · Computer Science 2025-10-24 Nikola Jovanović , Ismail Labiad , Tomáš Souček , Martin Vechev , Pierre Fernandez

Language models now routinely produce text that is difficult to distinguish from human writing, raising the need for robust tools to verify content provenance. Watermarking has emerged as a promising countermeasure, with existing work…

Cryptography and Security · Computer Science 2026-02-18 Huijia Lin , Kameron Shahabi , Min Jae Song

As LLMs become commonplace, machine-generated text has the potential to flood the internet with spam, social media bots, and valueless content. Watermarking is a simple and effective strategy for mitigating such harms by enabling the…

We study the problem of watermarking large language models (LLMs) generated text -- one of the most promising approaches for addressing the safety challenges of LLM usage. In this paper, we propose a rigorous theoretical framework to…

Computation and Language · Computer Science 2023-10-16 Xuandong Zhao , Prabhanjan Ananth , Lei Li , Yu-Xiang Wang

Potential harms of large language models can be mitigated by watermarking model output, i.e., embedding signals into generated text that are invisible to humans but algorithmically detectable from a short span of tokens. We propose a…

Machine Learning · Computer Science 2024-05-03 John Kirchenbauer , Jonas Geiping , Yuxin Wen , Jonathan Katz , Ian Miers , Tom Goldstein

We present the first in depth study on the robustness of existing watermarking techniques applied to code generated by large language models (LLMs). As LLMs increasingly contribute to software development, watermarking has emerged as a…

Cryptography and Security · Computer Science 2025-08-21 Tarun Suresh , Shubham Ugare , Gagandeep Singh , Sasa Misailovic

Watermarking is a technique that involves embedding nearly unnoticeable statistical signals within generated content to help trace its source. This work focuses on a scenario where an untrusted third-party user sends prompts to a trusted…

Machine Learning · Computer Science 2024-10-29 Xingchi Li , Guanxun Li , Xianyang Zhang

The capabilities of large language models have grown significantly in recent years and so too have concerns about their misuse. It is important to be able to distinguish machine-generated text from human-authored content. Prior works have…

Cryptography and Security · Computer Science 2024-10-15 Julien Piet , Chawin Sitawarin , Vivian Fang , Norman Mu , David Wagner
‹ Prev 1 2 3 10 Next ›