KoRe: Compact Knowledge Representations for Large Language Models

Davide Cavicchini; Fausto Giunchiglia; Jacopo Staiano

KoRe: Compact Knowledge Representations for Large Language Models

Computation and Language 2026-05-20 v1

Authors: Davide Cavicchini , Fausto Giunchiglia , Jacopo Staiano

Abstract

Modern Large Language Models (LLMs) have shown impressive performances in user-facing tasks such as question answering, as well as consistent improvements in reasoning capabilities. Still, the way these models encode knowledge seems inherently flawed: by design, LLMs encode world-knowledge within their parameters. This way of representing knowledge is inherently opaque, difficult to debug and update, and prone to hallucinations. On the other hand, Knowledge Graphs can provide human-readable and easily editable world knowledge representations, and their application in knowledge-intensive tasks has consistently proven beneficial to downstream performance. Nonetheless, current integration techniques require extensive retraining or finetuning. To overcome this issue, we introduce KoRe, a methodology to encode 1-hop sub-graphs into compact discrete knowledge tokens and inject them into a LLM backbone. We test the proposed approach on three established benchmarks, and report competitive performances coupled with a significant reduction (up to 10x) in token usage. Our results show that compact discrete KG representations can efficiently and effectively be used to ground modern LLMs.

Keywords

knowledge graph tokenization large language model

Cite

@article{arxiv.2605.20170,
  title  = {KoRe: Compact Knowledge Representations for Large Language Models},
  author = {Davide Cavicchini and Fausto Giunchiglia and Jacopo Staiano},
  journal= {arXiv preprint arXiv:2605.20170},
  year   = {2026}
}

KoRe: Compact Knowledge Representations for Large Language Models

Abstract

Keywords

Cite

Related papers