English
Related papers

Related papers: Adaptive Data Optimization: Dynamic Sample Selecti…

200 papers

Modern data workflows are inherently adaptive, repeatedly querying the same dataset to refine and validate sequential decisions, but such adaptivity can lead to overfitting and invalid statistical inference. Adaptive Data Analysis (ADA)…

Machine Learning · Computer Science 2026-02-10 Joon Suk Huh

Adaptive design optimization (ADO) is a state-of-the-art technique for experimental design (Cavagnaro, Myung, Pitt, & Kujala, 2010). ADO dynamically identifies stimuli that, in expectation, yield the most information about a hypothetical…

Applications · Statistics 2024-07-10 Sabina J. Sloman , Daniel Cavagnaro , Stephen B. Broomell

To train machine learning models that are robust to distribution shifts in the data, distributionally robust optimization (DRO) has been proven very effective. However, the existing approaches to learning a distributionally robust model…

Machine Learning · Computer Science 2022-03-21 Farzin Haddadpour , Mohammad Mahdi Kamani , Mehrdad Mahdavi , Amin Karbasi

Stochastic Optimization (SO) is a classical approach for optimization under uncertainty that typically requires knowledge about the probability distribution of uncertain parameters. As the latter is often unknown, Distributionally Robust…

We introduce a framework for optimizing domain-specific dataset construction in foundation model training. Specifically, we seek a cost-efficient way to estimate the quality of data sources (e.g. synthetically generated or filtered web…

While traditional Deep Learning (DL) optimization methods treat all training samples equally, Distributionally Robust Optimization (DRO) adaptively assigns importance weights to different samples. However, a significant gap exists between…

We propose a framework for online meta-optimization of parameters that govern optimization, called Amortized Proximal Optimization (APO). We first interpret various existing neural network optimizers as approximate stochastic proximal point…

Machine Learning · Computer Science 2022-03-02 Juhan Bae , Paul Vicol , Jeff Z. HaoChen , Roger Grosse

Large foundation models are typically trained on data from multiple domains, with the data mixture--the proportion of each domain used--playing a critical role in model performance. The standard approach to selecting this mixture relies on…

Machine Learning · Computer Science 2025-10-03 Mustafa Shukor , Louis Bethune , Dan Busbridge , David Grangier , Enrico Fini , Alaaeldin El-Nouby , Pierre Ablin

The calibration of simulators for complex social systems aims to identify the optimal parameter that drives the output of the simulator best matching the target data observed from the system. As many social systems may change internally…

Neural and Evolutionary Computing · Computer Science 2026-01-28 Peng Yang , Zhenhua Yang , Boquan Jiang , Chenkai Wang , Ke Tang , Xin Yao

This paper considers a class of real-time stochastic optimization problems dependent on an unknown probability distribution. In the considered scenario, data is streaming frequently while trying to reach a decision. Thus, we aim to devise a…

Optimization and Control · Mathematics 2020-09-08 Dan Li , Sonia Martinez

We describe a framework for deriving and analyzing online optimization algorithms that incorporate adaptive, data-dependent regularization, also termed preconditioning. Such algorithms have been proven useful in stochastic optimization by…

Machine Learning · Computer Science 2017-06-21 Vineet Gupta , Tomer Koren , Yoram Singer

In this paper, we propose a practical online method for solving a class of distributionally robust optimization (DRO) with non-convex objectives, which has important applications in machine learning for improving the robustness of neural…

Machine Learning · Computer Science 2021-11-15 Qi Qi , Zhishuai Guo , Yi Xu , Rong Jin , Tianbao Yang

In the past few years, Online Convex Optimization (OCO) has received notable attention in the control literature thanks to its flexible real-time nature and powerful performance guarantees. In this paper, we propose new step-size rules and…

Optimization and Control · Mathematics 2023-01-18 Pedro Zattoni Scroccaro , Arman Sharifi Kolarijani , Peyman Mohajerin Esfahani

In recent years, the state-of-the-art in deep learning has been dominated by very large models that have been pre-trained on vast amounts of data. The paradigm is very simple: investing more computational resources (optimally) leads to…

Machine Learning · Computer Science 2024-05-24 Sotiris Anagnostidis , Gregor Bachmann , Imanol Schlag , Thomas Hofmann

Reinforcement learning has become a cornerstone technique for developing reasoning models in complex tasks, ranging from mathematical problem-solving to imaginary reasoning. The optimization of these models typically relies on policy…

Machine Learning · Computer Science 2026-02-11 Qingnan Ren , Shiting Huang , Zhen Fang , Zehui Chen , Lin Chen , Lijun Li , Feng Zhao

Large neural networks pretrained on web-scale corpora are central to modern machine learning. In this paradigm, the distribution of the large, heterogeneous pretraining data rarely matches that of the application domain. This work considers…

Machine Learning · Computer Science 2023-11-21 David Grangier , Pierre Ablin , Awni Hannun

Large time series models (LTMs) have emerged as powerful tools for universal forecasting, yet they often struggle with the inherent diversity and nonstationarity of real-world time series data, leading to an unsatisfactory trade-off between…

Machine Learning · Computer Science 2026-03-03 Yunzhong Qiu , Zhiyao Cen , Zhongyi Pei , Chen Wang , Jianmin Wang

Forecast-then-optimize is a widely-used framework for decision-making problems in power systems. Traditionally, statistical losses have been employed to train forecasting models, but recent research demonstrated that improved decision…

Systems and Control · Electrical Eng. & Systems 2023-12-22 Haipeng Zhang , Ran Li , Mingyang Sun , Teng Fei

Deep neural networks are typically trained by uniformly sampling large datasets across epochs, despite evidence that not all samples contribute equally throughout learning. Recent work shows that progressively reducing the amount of…

Machine Learning · Computer Science 2026-04-15 Amar Gahir , Varshil Patel , Shreyank N Gowda

Machine learning systems based on minimizing average error have been shown to perform inconsistently across notable subsets of the data, which is not exposed by a low average error for the entire dataset. In consequential social and…

Machine Learning · Computer Science 2021-06-18 Agnieszka Słowik , Léon Bottou
‹ Prev 1 2 3 10 Next ›