An Online Reference-Free Evaluation Framework for Flowchart Image-to-Code Generation

Giang Son Nguyen; Zi Pong Lim; Sarthak Ketanbhai Modi; Yon Shin Teo; Wenya Wang

An Online Reference-Free Evaluation Framework for Flowchart Image-to-Code Generation

Computer Vision and Pattern Recognition 2026-02-17 v1 Artificial Intelligence Computation and Language

Authors: Giang Son Nguyen , Zi Pong Lim , Sarthak Ketanbhai Modi , Yon Shin Teo , Wenya Wang

Abstract

Vision-Language Models (VLMs) are increasingly used in document processing pipelines to convert flowchart images into structured code (e.g., Mermaid). In production, these systems process arbitrary inputs for which no ground-truth code exists, making output quality difficult to assess. We propose a reference-free evaluation framework that monitors flowchart image-to-code generation quality at inference time, using only the input image and the generated output. The framework introduces two automated metrics: $\text{Recall}{\text{OCR}}$ , which estimates content coverage by extracting text from the input image via OCR as a proxy reference, and $\text{Precision}{\text{VE}}$ , which detects hallucinated elements through Visual Entailment against the original image. Their harmonic mean, $\text{F1}{\text{OCR-VE}}$ , provides a unified quality score. Validation on the FlowVQA dataset shows strong agreement with ground-truth metrics (average Pearson's $r = 0.97$ , $0.91$ , and $0.94$ for Recall, Precision, and F1, respectively), confirming the framework's reliability as a practical, reference-free alternative for continuous quality monitoring in production settings.

Keywords

benchmark evaluation code generation program analysis

Cite

@article{arxiv.2602.13376,
  title  = {An Online Reference-Free Evaluation Framework for Flowchart Image-to-Code Generation},
  author = {Giang Son Nguyen and Zi Pong Lim and Sarthak Ketanbhai Modi and Yon Shin Teo and Wenya Wang},
  journal= {arXiv preprint arXiv:2602.13376},
  year   = {2026}
}

Comments

9 pages, 4 tables. Under review

An Online Reference-Free Evaluation Framework for Flowchart Image-to-Code Generation

Abstract

Keywords

Cite

Comments

Related papers