English

Optimizing Quantum Data Embeddings for Ligand-Based Virtual Screening

Quantum Physics 2025-12-19 v1

Abstract

Effective molecular representations are essential for ligand-based virtual screening. We investigate how quantum data embedding strategies can improve this task by developing and evaluating a family of quantum-classical hybrid embedding approaches. These approaches combine classical neural networks with parameterized quantum circuits in different ways to generate expressive molecular representations and are assessed across two benchmark datasets of different sizes: the LIT-PCBA and COVID-19 collections. Across multiple biological targets and class-imbalance settings, several quantum and hybrid embedding variants consistently outperform classical baselines, especially in limited-data regimes. These results highlight the potential of optimized quantum data embeddings as data-efficient tools for ligand-based virtual screening.

Keywords

Cite

@article{arxiv.2512.16177,
  title  = {Optimizing Quantum Data Embeddings for Ligand-Based Virtual Screening},
  author = {Junggu Choi and Tak Hur and Seokhoon Jeong and Kyle L. Jung and Jun Bae Park and Junho Lee and Jae U. Jung and Daniel K. Park},
  journal= {arXiv preprint arXiv:2512.16177},
  year   = {2025}
}

Comments

16 pages, 5 figures