English

Knowledge Return Oriented Prompting (KROP)

Cryptography and Security 2024-06-19 v1 Machine Learning

Abstract

Many Large Language Models (LLMs) and LLM-powered apps deployed today use some form of prompt filter or alignment to protect their integrity. However, these measures aren't foolproof. This paper introduces KROP, a prompt injection technique capable of obfuscating prompt injection attacks, rendering them virtually undetectable to most of these security measures.

Keywords

Cite

@article{arxiv.2406.11880,
  title  = {Knowledge Return Oriented Prompting (KROP)},
  author = {Jason Martin and Kenneth Yeung},
  journal= {arXiv preprint arXiv:2406.11880},
  year   = {2024}
}
R2 v1 2026-06-28T17:09:10.913Z