Marking Code Without Breaking It: Code Watermarking for Detecting LLM-Generated Code

Jungin Kim; Shinwoo Park; Yo-Sub Han

Marking Code Without Breaking It: Code Watermarking for Detecting LLM-Generated Code

Cryptography and Security 2026-02-10 v4 Artificial Intelligence

Authors: Jungin Kim , Shinwoo Park , Yo-Sub Han

Abstract

Identifying LLM-generated code through watermarking poses a challenge in preserving functional correctness. Previous methods rely on the assumption that watermarking high-entropy tokens effectively maintains output quality. Our analysis reveals a fundamental limitation of this assumption: syntax-critical tokens such as keywords often exhibit the highest entropy, making existing approaches vulnerable to logic corruption. We present STONE, a syntax-aware watermarking method that embeds watermarks only in non-syntactic tokens and preserves code integrity. For rigorous evaluation, we also introduce STEM, a comprehensive metric that balances three critical dimensions: correctness, detectability, and imperceptibility. Across Python, C++, and Java, STONE preserves correctness, sustains strong detectability, and achieves balanced performance with minimal computational overhead. Our implementation is available at https://github.com/inistory/STONE-watermarking.

Keywords

model watermarking benchmark evaluation code generation

Cite

@article{arxiv.2502.18851,
  title  = {Marking Code Without Breaking It: Code Watermarking for Detecting LLM-Generated Code},
  author = {Jungin Kim and Shinwoo Park and Yo-Sub Han},
  journal= {arXiv preprint arXiv:2502.18851},
  year   = {2026}
}

Comments

Findings of EACL 2026

Marking Code Without Breaking It: Code Watermarking for Detecting LLM-Generated Code

Abstract

Keywords

Cite

Comments

Related papers