Related papers: UDO: Universal Database Optimization using Reinfor…

Deep Reinforcement Learning for Online Computation Offloading in Wireless Powered Mobile-Edge Computing Networks

Wireless powered mobile-edge computing (MEC) has recently emerged as a promising paradigm to enhance the data processing capability of low-power networks, such as wireless sensor networks and internet of things (IoT). In this paper, we…

Networking and Internet Architecture · Computer Science 2020-07-16 Liang Huang , Suzhi Bi , Ying-Jun Angela Zhang

AURO: Reinforcement Learning for Adaptive User Retention Optimization in Recommender Systems

The field of Reinforcement Learning (RL) has garnered increasing attention for its ability of optimizing user retention in recommender systems. A primary obstacle in this optimization process is the environment non-stationarity stemming…

Information Retrieval · Computer Science 2025-02-27 Zhenghai Xue , Qingpeng Cai , Bin Yang , Lantao Hu , Peng Jiang , Kun Gai , Bo An

Adaptive Data Optimization: Dynamic Sample Selection with Scaling Laws

The composition of pretraining data is a key determinant of foundation models' performance, but there is no standard guideline for allocating a limited computational budget across different data sources. Most current approaches either rely…

Machine Learning · Computer Science 2024-10-16 Yiding Jiang , Allan Zhou , Zhili Feng , Sadhika Malladi , J. Zico Kolter

UDuo: Universal Dual Optimization Framework for Online Matching

Online resource allocation under budget constraints critically depends on proper modeling of user arrival dynamics. Classical approaches employ stochastic user arrival models to derive near-optimal solutions through fractional matching…

Information Retrieval · Computer Science 2025-05-29 Bin Li , Diwei Liu , Zehong Hu , Jia Jia

ADORA: Training Reasoning Models with Dynamic Advantage Estimation on Reinforcement Learning

Reinforcement learning has become a cornerstone technique for developing reasoning models in complex tasks, ranging from mathematical problem-solving to imaginary reasoning. The optimization of these models typically relies on policy…

Machine Learning · Computer Science 2026-02-11 Qingnan Ren , Shiting Huang , Zhen Fang , Zehui Chen , Lin Chen , Lijun Li , Feng Zhao

Learning to Do or Learning While Doing: Reinforcement Learning and Bayesian Optimisation for Online Continuous Tuning

Online tuning of real-world plants is a complex optimisation problem that continues to require manual intervention by experienced human operators. Autonomous tuning is a rapidly expanding field of research, where learning-based methods,…

Machine Learning · Computer Science 2023-06-07 Jan Kaiser , Chenran Xu , Annika Eichler , Andrea Santamaria Garcia , Oliver Stein , Erik Bründermann , Willi Kuropka , Hannes Dinter , Frank Mayet , Thomas Vinatier , Florian Burkart , Holger Schlarb

The Case for Automatic Database Administration using Deep Reinforcement Learning

Like any large software system, a full-fledged DBMS offers an overwhelming amount of configuration knobs. These range from static initialisation parameters like buffer sizes, degree of concurrency, or level of replication to complex runtime…

Databases · Computer Science 2018-01-18 Ankur Sharma , Felix Martin Schuhknecht , Jens Dittrich

How to "DODGE" Complex Software Analytics?

Machine learning techniques applied to software engineering tasks can be improved by hyperparameter optimization, i.e., automatic tools that find good settings for a learner's control parameters. We show that such hyperparameter…

Software Engineering · Computer Science 2019-12-03 Amritanshu Agrawal , Wei Fu , Di Chen , Xipeng Shen , Tim Menzies

ODDO: Online Duality-Driven Optimization

Motivated by energy management for micro-grids, we study convex optimization problems with uncertainty in the objective function and sequential decision making. To solve these problems, we propose a new framework called ``Online…

Optimization and Control · Mathematics 2020-08-25 Martijn H. H. Schoot Uiterkamp , Marco E. T. Gerards , Johann L. Hurink

Facilitating Database Tuning with Hyper-Parameter Optimization: A Comprehensive Experimental Evaluation

Recently, using automatic configuration tuning to improve the performance of modern database management systems (DBMSs) has attracted increasing interest from the database community. This is embodied with a number of systems featuring…

Databases · Computer Science 2022-03-15 Xinyi Zhang , Zhuo Chang , Yang Li , Hong Wu , Jian Tan , Feifei Li , Bin Cui

Guided Automated Learning for query workload re-Optimization

Query optimization is a hallmark of database systems enabling complex SQL queries of today's applications to be run efficiently. The query optimizer often fails to find the best plan, when logical subtleties in business queries and schemas…

Databases · Computer Science 2019-05-23 Guilherme Damasio , Vincent Corvinelli , Parke Godfrey , Piotr Mierzejewski , Alexandar Mihaylov , Jaroslaw Szlichta , Calisto Zuzarte

One-shot, Offline and Production-Scalable PID Optimisation with Deep Reinforcement Learning

Proportional-integral-derivative (PID) control underlies more than $97\%$ of automated industrial processes. Controlling these processes effectively with respect to some specified set of performance goals requires finding an optimal set of…

Systems and Control · Electrical Eng. & Systems 2022-10-26 Zacharaya Shabka , Michael Enrico , Nick Parsons , Georgios Zervas

Direct Value Optimization: Improving Chain-of-Thought Reasoning in LLMs with Refined Values

We introduce Direct Value Optimization (DVO), an innovative reinforcement learning framework for enhancing large language models in complex reasoning tasks. Unlike traditional methods relying on preference labels, DVO utilizes value signals…

Computation and Language · Computer Science 2025-02-20 Hongbo Zhang , Han Cui , Guangsheng Bao , Linyi Yang , Jun Wang , Yue Zhang

Adaptive Preference Optimization with Uncertainty-aware Utility Anchor

Offline preference optimization methods are efficient for large language models (LLMs) alignment. Direct Preference optimization (DPO)-like learning, one of the most popular approaches, stands out for its efficiency in reward modeling.…

Machine Learning · Computer Science 2026-05-26 Xiaobo Wang , Zixia Jia , Jiaqi Li , Qi Liu , Zilong Zheng

DOT: Dynamic Knob Selection and Online Sampling for Automated Database Tuning

Database Management Systems (DBMS) are crucial for efficient data management and access control, but their administration remains challenging for Database Administrators (DBAs). Tuning, in particular, is known to be difficult. Modern…

Databases · Computer Science 2026-03-17 Yifan Wang , Debabrota Basu , Pierre Bourhis , Romain Rouvoy , Patrick Royer

Learning-Augmented Decentralized Online Convex Optimization in Networks

This paper studies decentralized online convex optimization in a networked multi-agent system and proposes a novel algorithm, Learning-Augmented Decentralized Online optimization (LADO), for individual agents to select actions only based on…

Machine Learning · Computer Science 2024-10-21 Pengfei Li , Jianyi Yang , Adam Wierman , Shaolei Ren

Neo: A Learned Query Optimizer

Query optimization is one of the most challenging problems in database systems. Despite the progress made over the past decades, query optimizers remain extremely complex components that require a great deal of hand-tuning for specific…

Databases · Computer Science 2020-04-09 Ryan Marcus , Parimarjan Negi , Hongzi Mao , Chi Zhang , Mohammad Alizadeh , Tim Kraska , Olga Papaemmanouil , Nesime Tatbul

RELOAD: A Robust and Efficient Learned Query Optimizer for Database Systems

Recent advances in query optimization have shifted from traditional rule-based and cost-based techniques towards machine learning-driven approaches. Among these, reinforcement learning (RL) has attracted significant attention due to its…

Databases · Computer Science 2026-04-17 Seokwon Lee , Jaeyoung Sim , Sihyun Kim , Yuhsing Li , Yiwen Zhu , Kwanghyun Park

Improving Optimization Bounds using Machine Learning: Decision Diagrams meet Deep Reinforcement Learning

Finding tight bounds on the optimal solution is a critical element of practical solution methods for discrete optimization problems. In the last decade, decision diagrams (DDs) have brought a new perspective on obtaining upper and lower…

Artificial Intelligence · Computer Science 2019-02-28 Quentin Cappart , Emmanuel Goutierre , David Bergman , Louis-Martin Rousseau

Amortized Proximal Optimization

We propose a framework for online meta-optimization of parameters that govern optimization, called Amortized Proximal Optimization (APO). We first interpret various existing neural network optimizers as approximate stochastic proximal point…

Machine Learning · Computer Science 2022-03-02 Juhan Bae , Paul Vicol , Jeff Z. HaoChen , Roger Grosse