Related papers: Bayesian Optimization of Text Representations

Interactive Text Ranking with Bayesian Optimisation: A Case Study on Community QA and Summarisation

For many NLP applications, such as question answering and summarisation, the goal is to select the best solution from a large space of candidates to meet a particular user's needs. To address the lack of user-specific training data, we…

Computation and Language · Computer Science 2020-09-15 Edwin Simpson , Yang Gao , Iryna Gurevych

Bayesian Optimization in Linear Time

Bayesian optimization is a sequential method for minimizing objective functions that are expensive to evaluate and about which few assumptions can be made. By using all gathered data to train a Gaussian process model for the function and…

Machine Learning · Computer Science 2026-05-07 Jesse Schneider , William J. Welch

Comparing Text Representations: A Theory-Driven Approach

Much of the progress in contemporary NLP has come from learning representations, such as masked language model (MLM) contextual embeddings, that turn challenging problems into simple classification tasks. But how do we quantify and explain…

Computation and Language · Computer Science 2021-09-16 Gregory Yauney , David Mimno

Practical Bayesian Optimization of Machine Learning Algorithms

Machine learning algorithms frequently require careful tuning of model hyperparameters, regularization terms, and optimization parameters. Unfortunately, this tuning is often a "black art" that requires expert experience, unwritten rules of…

Machine Learning · Statistics 2012-08-30 Jasper Snoek , Hugo Larochelle , Ryan P. Adams

SemanticOpt: Towards LLM-Based Semantic Black-Box Optimization

Optimizing an experimental system can be extremely challenging when each experiment is expensive, time-consuming, or difficult to perform. Existing optimizers for expensive black-box problems, such as Bayesian optimization, are typically…

Machine Learning · Computer Science 2026-05-18 Jamison Meindl , Yunsheng Tian , Tony Cui , Veronika Thost , Zhang-Wei Hong , Jie Chen , Wojciech Matusik , Mina Konaković Luković

Scalable Bayesian Optimization Using Deep Neural Networks

Bayesian optimization is an effective methodology for the global optimization of functions with expensive evaluations. It relies on querying a distribution over functions defined by a relatively cheap surrogate model. An accurate model for…

Machine Learning · Statistics 2015-07-14 Jasper Snoek , Oren Rippel , Kevin Swersky , Ryan Kiros , Nadathur Satish , Narayanan Sundaram , Md. Mostofa Ali Patwary , Prabhat , Ryan P. Adams

Language Model Embeddings Can Be Sufficient for Bayesian Optimization

Bayesian Optimization is ubiquitous in experimental design and black-box optimization for improving search efficiency. However, most existing approaches rely on regression models which are limited to fixed search spaces and structured,…

Machine Learning · Computer Science 2025-10-10 Tung Nguyen , Qiuyi Zhang , Bangding Yang , Chansoo Lee , Jorg Bornschein , Yingjie Miao , Sagi Perel , Yutian Chen , Xingyou Song

Rank over Class: The Untapped Potential of Ranking in Natural Language Processing

Text classification has long been a staple within Natural Language Processing (NLP) with applications spanning across diverse areas such as sentiment analysis, recommender systems and spam detection. With such a powerful solution, it is…

Computation and Language · Computer Science 2021-12-06 Amir Atapour-Abarghouei , Stephen Bonner , Andrew Stephen McGough

Joint Optimization of Tokenization and Downstream Model

Since traditional tokenizers are isolated from a downstream task and model, they cannot output an appropriate tokenization depending on the task and model, although recent studies imply that the appropriate tokenization improves the…

Computation and Language · Computer Science 2021-05-27 Tatsuya Hiraoka , Sho Takase , Kei Uchiumi , Atsushi Keyaki , Naoaki Okazaki

Ordinal Bayesian Optimisation

Bayesian optimisation is a powerful tool to solve expensive black-box problems, but fails when the stationary assumption made on the objective function is strongly violated, which is the case in particular for ill-conditioned or…

Machine Learning · Statistics 2019-12-06 Victor Picheny , Sattar Vakili , Artem Artemev

Function Space Particle Optimization for Bayesian Neural Networks

While Bayesian neural networks (BNNs) have drawn increasing attention, their posterior inference remains challenging, due to the high-dimensional and over-parameterized nature. To address this issue, several highly flexible and scalable…

Machine Learning · Statistics 2019-05-10 Ziyu Wang , Tongzheng Ren , Jun Zhu , Bo Zhang

Random Postprocessing for Combinatorial Bayesian Optimization

Model-based sequential approaches to discrete "black-box" optimization, including Bayesian optimization techniques, often access the same points multiple times for a given objective function in interest, resulting in many steps to find the…

Machine Learning · Computer Science 2023-12-29 Keisuke Morita , Yoshihiko Nishikawa , Masayuki Ohzeki

Bayesian Optimization of Bilevel Problems

Bilevel optimization, a hierarchical mathematical framework where one optimization problem is nested within another, has emerged as a powerful tool for modeling complex decision-making processes in various fields such as economics,…

Machine Learning · Computer Science 2024-12-25 Omer Ekmekcioglu , Nursen Aydin , Juergen Branke

Progressive Sampling-Based Bayesian Optimization for Efficient and Automatic Machine Learning Model Selection

Purpose: Machine learning is broadly used for clinical data analysis. Before training a model, a machine learning algorithm must be selected. Also, the values of one or more model parameters termed hyper-parameters must be set. Selecting…

Machine Learning · Computer Science 2018-12-10 Xueqiang Zeng , Gang Luo

Large Scale Multi-Task Bayesian Optimization with Large Language Models

In multi-task Bayesian optimization, the goal is to leverage experience from optimizing existing tasks to improve the efficiency of optimizing new ones. While approaches using multi-task Gaussian processes or deep kernel transfer exist, the…

Machine Learning · Computer Science 2025-06-13 Yimeng Zeng , Natalie Maus , Haydn Thomas Jones , Jeffrey Tao , Fangping Wan , Marcelo Der Torossian Torres , Cesar de la Fuente-Nunez , Ryan Marcus , Osbert Bastani , Jacob R. Gardner

Bayesian Optimization for Selecting Efficient Machine Learning Models

The performance of many machine learning models depends on their hyper-parameter settings. Bayesian Optimization has become a successful tool for hyper-parameter optimization of machine learning algorithms, which aims to identify optimal…

Machine Learning · Computer Science 2020-08-04 Lidan Wang , Franck Dernoncourt , Trung Bui

Bayesian Optimization using Deep Gaussian Processes

Bayesian Optimization using Gaussian Processes is a popular approach to deal with the optimization of expensive black-box functions. However, because of the a priori on the stationarity of the covariance matrix of classic Gaussian…

Machine Learning · Statistics 2019-05-10 Ali Hebbal , Loic Brevault , Mathieu Balesdent , El-Ghazali Talbi , Nouredine Melab

Training LLMs Beyond Next Token Prediction -- Filling the Mutual Information Gap

Optimizing training performance in large language models (LLMs) remains an essential challenge, particularly in improving model performance while maintaining computational costs. This work challenges the conventional approach of training…

Computation and Language · Computer Science 2025-11-04 Chun-Hao Yang , Bo-Han Feng , Tzu-Yuan Lai , Yan Yu Chen , Yin-Kai Dean Huang , Shou-De Lin

Improving Tokenisation by Alternative Treatment of Spaces

Tokenisation is the first step in almost all NLP tasks, and state-of-the-art transformer-based language models all use subword tokenisation algorithms to process input text. Existing algorithms have problems, often producing tokenisations…

Computation and Language · Computer Science 2022-10-25 Edward Gow-Smith , Harish Tayyar Madabushi , Carolina Scarton , Aline Villavicencio

Deep Learning for Explicitly Modeling Optimization Landscapes

In all but the most trivial optimization problems, the structure of the solutions exhibit complex interdependencies between the input parameters. Decades of research with stochastic search techniques has shown the benefit of explicitly…

Neural and Evolutionary Computing · Computer Science 2017-03-23 Shumeet Baluja