Related papers: Watermarking Generative Tabular Data

Adaptive and Robust Watermark for Generative Tabular Data

In recent years, watermarking generative tabular data has become a prominent framework to protect against the misuse of synthetic data. However, while most prior work in watermarking methods for tabular data demonstrate a wide variety of…

Cryptography and Security · Computer Science 2025-11-17 Dung Daniel Ngo , Archan Ray , Akshay Seshadri , Daniel Scott , Saheed Obitayo , Niraj Kumar , Vamsi K. Potluru , Marco Pistoia , Manuela Veloso

TabularMark: Watermarking Tabular Datasets for Machine Learning

Watermarking is broadly utilized to protect ownership of shared data while preserving data utility. However, existing watermarking methods for tabular datasets fall short on the desired properties (detectability, non-intrusiveness, and…

Cryptography and Security · Computer Science 2024-06-24 Yihao Zheng , Haocheng Xia , Junyuan Pang , Jinfei Liu , Kui Ren , Lingyang Chu , Yang Cao , Li Xiong

Watermarking Generative Categorical Data

In this paper, we propose a novel statistical framework for watermarking generative categorical data. Our method systematically embeds pre-agreed secret signals by splitting the data distribution into two components and modifying one…

Cryptography and Security · Computer Science 2024-11-19 Bochao Gu , Hengzhi He , Guang Cheng

TableMark: A Multi-bit Watermark for Synthetic Tabular Data

Watermarking has emerged as an effective solution for copyright protection of synthetic data. However, applying watermarking techniques to synthetic tabular data presents challenges, as tabular data can easily lose their watermarks through…

Cryptography and Security · Computer Science 2026-03-17 Yuyang Xia , Yaoqiang Xu , Chen Qian , Yang Li , Guoliang Li , Jianhua Feng

Mitigating Watermark Forgery in Generative Models via Randomized Key Selection

Watermarking enables GenAI providers to verify whether content was generated by their models. A watermark is a hidden signal in the content, whose presence can be detected using a secret watermark key. A core security threat are forgery…

Cryptography and Security · Computer Science 2026-05-12 Toluwani Aremu , Noor Hussein , Munachiso Nwadike , Samuele Poppi , Jie Zhang , Karthik Nandakumar , Neil Gong , Nils Lukas

Watermarks Attack Watermarks: Re-Watermarking as a Generic Removal Strategy

Watermarking combines an imperceptible change to an input image that will trigger a detector, to assert provenance and protect intellectual property. The literature has shown great interest in attacks on watermarking schemes: attackers are…

Cryptography and Security · Computer Science 2026-05-19 Maria Bulychev , Neil G. Marchant , Benjamin I. P. Rubinstein

SEAL: Semantic Aware Image Watermarking

Generative models have rapidly evolved to generate realistic outputs. However, their synthetic outputs increasingly challenge the clear distinction between natural and AI-generated content, necessitating robust watermarking techniques.…

Machine Learning · Computer Science 2026-05-20 Kasra Arabi , R. Teal Witter , Chinmay Hegde , Niv Cohen

Optimal Watermark Generation under Type I and Type II Errors

Watermarking has recently emerged as a crucial tool for protecting the intellectual property of generative models and for distinguishing AI-generated content from human-generated data. Despite its practical success, most existing…

Methodology · Statistics 2025-12-08 Hengzhi He , Shirong Xu , Alexander Nemecek , Jiping Li , Erman Ayday , Guang Cheng

Image Watermarking of Generative Diffusion Models

Embedding watermarks into the output of generative models is essential for establishing copyright and verifiable ownership over the generated content. Emerging diffusion model watermarking methods either embed watermarks in the frequency…

Image and Video Processing · Electrical Eng. & Systems 2025-02-18 Yunzhuo Chen , Jordan Vice , Naveed Akhtar , Nur Al Hasan Haldar , Ajmal Mian

Watermarks in the Sand: Impossibility of Strong Watermarking for Generative Models

Watermarking generative models consists of planting a statistical signal (watermark) in a model's output so that it can be later verified that the output was generated by the given model. A strong watermarking scheme satisfies the property…

Machine Learning · Computer Science 2025-05-29 Hanlin Zhang , Benjamin L. Edelman , Danilo Francati , Daniele Venturi , Giuseppe Ateniese , Boaz Barak

Watermarking Language Models with Error Correcting Codes

Recent progress in large language models enables the creation of realistic machine-generated content. Watermarking is a promising approach to distinguish machine-generated text from human text, embedding statistical signals in the output…

Cryptography and Security · Computer Science 2026-02-25 Patrick Chao , Yan Sun , Edgar Dobriban , Hamed Hassani

Bayesian Watermark Attacks

This paper presents an application of statistical machine learning to the field of watermarking. We propose a new attack model on additive spread-spectrum watermarking systems. The proposed attack is based on Bayesian statistics. We…

Cryptography and Security · Computer Science 2012-06-22 Ivo Shterev , David Dunson

Spread them Apart: Towards Robust Watermarking of Generated Content

Generative models that can produce realistic images have improved significantly in recent years. The quality of the generated content has increased drastically, so sometimes it is very difficult to distinguish between the real images and…

Computer Vision and Pattern Recognition · Computer Science 2026-03-02 Mikhail Pautov , Danil Ivanov , Andrey V. Galichin , Oleg Rogov , Ivan Oseledets

A Resilient and Accessible Distribution-Preserving Watermark for Large Language Models

Watermarking techniques offer a promising way to identify machine-generated content via embedding covert information into the contents generated from language models. A challenge in the domain lies in preserving the distribution of original…

Cryptography and Security · Computer Science 2024-06-26 Yihan Wu , Zhengmian Hu , Junfeng Guo , Hongyang Zhang , Heng Huang

Watermarking Digital Images Based on a Content Based Image Retrieval Technique

The current work is focusing on the implementation of a robust watermarking algorithm for digital images, which is based on an innovative spread spectrum analysis algorithm for watermark embedding and on a content-based image retrieval…

Data Structures and Algorithms · Computer Science 2009-09-29 Dimitrios K. Tsolis , Spyros Sioutas , Theodore S. Papatheodorou

A new Watermarking Technique for Secure Database

Digital multimedia watermarking technology was suggested in the last decade to embed copyright information in digital objects such images, audio and video. However, the increasing use of relational database systems in many real-life…

Databases · Computer Science 2013-04-29 Jun Ziang Pinn , A. Fr. Zung

A Framework to Allow a Third Party to Watermark Numerical Data in an Encrypted Domain while Preserving its Statistical Properties

Watermarking data for source tracking applications by its owner can be unfair for recipients because the data owner may redistribute the same watermarked data to many users. Hence, each data recipient should know the watermark embedded in…

Cryptography and Security · Computer Science 2023-02-06 Mesfer Mohammed Alqarni

Latent Watermark: Inject and Detect Watermarks in Latent Diffusion Space

Watermarking is a tool for actively identifying and attributing the images generated by latent diffusion models. Existing methods face the dilemma of image quality and watermark robustness. Watermarks with superior image quality usually…

Computer Vision and Pattern Recognition · Computer Science 2024-09-27 Zheling Meng , Bo Peng , Jing Dong

WaterSearch: A Quality-Aware Search-based Watermarking Framework for Large Language Models

Watermarking acts as a critical safeguard in text generated by Large Language Models (LLMs). By embedding identifiable signals into model outputs, watermarking enables reliable attribution and enhances the security of machine-generated…

Computation and Language · Computer Science 2026-05-29 Yukang Lin , Jiahao Shao , Shuoran Jiang , Wentao Zhu , Bingjie Lu , Xiangping Wu , Joanna Siebert , Qingcai Chen

Did You Train on My Dataset? Towards Public Dataset Protection with Clean-Label Backdoor Watermarking

The huge supporting training data on the Internet has been a key factor in the success of deep learning models. However, this abundance of public-available data also raises concerns about the unauthorized exploitation of datasets for…

Cryptography and Security · Computer Science 2023-04-11 Ruixiang Tang , Qizhang Feng , Ninghao Liu , Fan Yang , Xia Hu