English

Sample Efficient Generative Molecular Optimization with Joint Self-Improvement

Machine Learning 2026-02-12 v1

Abstract

Generative molecular optimization aims to design molecules with properties surpassing those of existing compounds. However, such candidates are rare and expensive to evaluate, yielding sample efficiency essential. Additionally, surrogate models introduced to predict molecule evaluations, suffer from distribution shift as optimization drives candidates increasingly out-of-distribution. To address these challenges, we introduce Joint Self-Improvement, which benefits from (i) a joint generative-predictive model and (ii) a self-improving sampling scheme. The former aligns the generator with the surrogate, alleviating distribution shift, while the latter biases the generative part of the joint model using the predictive one to efficiently generate optimized molecules at inference-time. Experiments across offline and online molecular optimization benchmarks demonstrate that Joint Self-Improvement outperforms state-of-the-art methods under limited evaluation budgets.

Keywords

Cite

@article{arxiv.2602.10984,
  title  = {Sample Efficient Generative Molecular Optimization with Joint Self-Improvement},
  author = {Serra Korkmaz and Adam Izdebski and Jonathan Pirnay and Rasmus Møller-Larsen and Michal Kmicikiewicz and Pankhil Gawade and Dominik G. Grimm and Ewa Szczurek},
  journal= {arXiv preprint arXiv:2602.10984},
  year   = {2026}
}

Comments

14 pages, 5 figures