Large Scale Multi-Task Bayesian Optimization with Large Language Models

Yimeng Zeng; Natalie Maus; Haydn Thomas Jones; Jeffrey Tao; Fangping Wan; Marcelo Der Torossian Torres; Cesar de la Fuente-Nunez; Ryan Marcus; Osbert Bastani; Jacob R. Gardner

Large Scale Multi-Task Bayesian Optimization with Large Language Models

Machine Learning 2025-06-13 v2

Authors: Yimeng Zeng , Natalie Maus , Haydn Thomas Jones , Jeffrey Tao , Fangping Wan , Marcelo Der Torossian Torres , Cesar de la Fuente-Nunez , Ryan Marcus , Osbert Bastani , Jacob R. Gardner

View on arXiv ↗ PDF ↗

Abstract

In multi-task Bayesian optimization, the goal is to leverage experience from optimizing existing tasks to improve the efficiency of optimizing new ones. While approaches using multi-task Gaussian processes or deep kernel transfer exist, the performance improvement is marginal when scaling beyond a moderate number of tasks. We introduce a novel approach leveraging large language models (LLMs) to learn from, and improve upon, previous optimization trajectories, scaling to approximately 1500 distinct tasks. Specifically, we propose a feedback loop in which an LLM is fine-tuned on the high quality solutions to specific tasks found by Bayesian optimization (BO). This LLM is then used to generate initialization points for future BO searches for new tasks. The trajectories of these new searches provide additional training data for fine-tuning the LLM, completing the loop. We evaluate our method on two distinct domains: database query optimization and antimicrobial peptide design. Results demonstrate that our approach creates a positive feedback loop, where the LLM's generated initializations gradually improve, leading to better optimization performance. As this feedback loop continues, we find that the LLM is eventually able to generate solutions to new tasks in just a few shots that are better than the solutions produced by "from scratch" by Bayesian optimization while simultaneously requiring significantly fewer oracle calls.

Keywords

evolutionary optimization large language model bayesian optimization

Cite

@article{arxiv.2503.08131,
  title  = {Large Scale Multi-Task Bayesian Optimization with Large Language Models},
  author = {Yimeng Zeng and Natalie Maus and Haydn Thomas Jones and Jeffrey Tao and Fangping Wan and Marcelo Der Torossian Torres and Cesar de la Fuente-Nunez and Ryan Marcus and Osbert Bastani and Jacob R. Gardner},
  journal= {arXiv preprint arXiv:2503.08131},
  year   = {2025}
}

Large Scale Multi-Task Bayesian Optimization with Large Language Models

Abstract

Keywords

Cite

Related papers