Related papers: Adaptive Discretization for Model-Based Reinforcem…

Adaptive Discretization for Episodic Reinforcement Learning in Metric Spaces

We present an efficient algorithm for model-free episodic reinforcement learning on large (potentially continuous) state-action spaces. Our algorithm is based on a novel $Q$-learning policy with adaptive data-driven discretization. The…

Machine Learning · Computer Science 2019-12-20 Sean R. Sinclair , Siddhartha Banerjee , Christina Lee Yu

Adaptive Discretization in Online Reinforcement Learning

Discretization based approaches to solving online reinforcement learning problems have been studied extensively in practice on applications ranging from resource allocation to cache management. Two major questions in designing…

Machine Learning · Statistics 2024-09-30 Sean R. Sinclair , Siddhartha Banerjee , Christina Lee Yu

Adaptive Partitioning and Learning for Stochastic Control of Diffusion Processes

We study reinforcement learning for controlled diffusion processes with unbounded continuous state spaces, bounded continuous actions, and polynomially growing rewards: settings that arise naturally in finance, economics, and operations…

Machine Learning · Computer Science 2025-12-18 Hanqing Jin , Renyuan Xu , Yanzhao Yang

An adaptive discretization algorithm for locally optimal experimental design with constraints

We develop a novel iterative algorithm for locally optimal experimental design under constraints, like budget or performance constraints. It is an adaptive discretization algorithm. In every iteration, a discretized version of the…

Optimization and Control · Mathematics 2026-04-21 Jochen Schmid , Philipp Seufert , Jan Schwientek , Tobias Seidel , Karl-Heinz Küfer

Improving Adaptive Online Learning Using Refined Discretization

We study unconstrained Online Linear Optimization with Lipschitz losses. Motivated by the pursuit of instance optimality, we propose a new algorithm that simultaneously achieves ($i$) the AdaGrad-style second order gradient adaptivity; and…

Machine Learning · Computer Science 2024-02-23 Zhiyu Zhang , Heng Yang , Ashok Cutkosky , Ioannis Ch. Paschalidis

Unsupervised Learning of Effective Actions in Robotics

Learning actions that are relevant to decision-making and can be executed effectively is a key problem in autonomous robotics. Current state-of-the-art action representations in robotics lack proper effect-driven learning of the robot's…

Robotics · Computer Science 2024-04-04 Marko Zaric , Jakob Hollenstein , Justus Piater , Erwan Renaudo

Adaptive discretization algorithms for locally optimal experimental design

We develop adaptive discretization algorithms for locally optimal experimental design of nonlinear prediction models. With these algorithms, we refine and improve a pertinent state-of-the-art algorithm in various respects. We establish…

Optimization and Control · Mathematics 2024-06-04 Jochen Schmid , Philipp Seufert , Michael Bortz

Zooming for Efficient Model-Free Reinforcement Learning in Metric Spaces

Despite the wealth of research into provably efficient reinforcement learning algorithms, most works focus on tabular representation and thus struggle to handle exponentially or infinitely large state-action spaces. In this paper, we…

Machine Learning · Computer Science 2020-03-10 Ahmed Touati , Adrien Ali Taiga , Marc G. Bellemare

Tighter Problem-Dependent Regret Bounds in Reinforcement Learning without Domain Knowledge using Value Function Bounds

Strong worst-case performance bounds for episodic reinforcement learning exist but fortunately in practice RL algorithms perform much better than such bounds would predict. Algorithms and theory that provide strong problem-dependent bounds…

Machine Learning · Computer Science 2019-11-05 Andrea Zanette , Emma Brunskill

The Best of Both Worlds: Reinforcement Learning with Logarithmic Regret and Policy Switches

In this paper, we study the problem of regret minimization for episodic Reinforcement Learning (RL) both in the model-free and the model-based setting. We focus on learning with general function classes and general model classes, and we…

Machine Learning · Computer Science 2022-03-04 Grigoris Velegkas , Zhuoran Yang , Amin Karbasi

Adaptive Discretization against an Adversary: Lipschitz bandits, Dynamic Pricing, and Auction Tuning

Lipschitz bandits is a prominent version of multi-armed bandits that studies large, structured action spaces such as the $[0,1]$ interval, where similar actions are guaranteed to have similar rewards. A central theme here is the adaptive…

Machine Learning · Computer Science 2025-06-13 Chara Podimata , Aleksandrs Slivkins

Episodic Self-Imitation Learning with Hindsight

Episodic self-imitation learning, a novel self-imitation algorithm with a trajectory selection module and an adaptive loss function, is proposed to speed up reinforcement learning. Compared to the original self-imitation learning algorithm,…

Artificial Intelligence · Computer Science 2020-11-30 Tianhong Dai , Hengyan Liu , Anil Anthony Bharath

Deep Reinforcement Learning for Adaptive Mesh Refinement

Finite element discretizations of problems in computational physics often rely on adaptive mesh refinement (AMR) to preferentially resolve regions containing important features during simulation. However, these spatial refinement strategies…

Computational Engineering, Finance, and Science · Computer Science 2022-09-27 Corbin Foucart , Aaron Charous , Pierre F. J. Lermusiaux

Accelerating Optimization via Adaptive Prediction

We present a powerful general framework for designing data-dependent optimization algorithms, building upon and unifying recent techniques in adaptive regularization, optimistic gradient predictions, and problem-dependent randomization. We…

Machine Learning · Statistics 2015-10-14 Mehryar Mohri , Scott Yang

Adaptive Regularization via Residual Smoothing in Deep Learning Optimization

We present an adaptive regularization algorithm that can be effectively applied to the optimization problem in deep learning framework. Our regularization algorithm aims to take into account the fitness of data to the current state of model…

Machine Learning · Computer Science 2019-09-02 Junghee Cho , Junseok Kwon , Byung-Woo Hong

On adaptive discretization schemes for the regularization of ill-posed problems with semiiterative methods

In this paper we investigate an adaptive discretization strategy for ill-posed linear prob- lems combined with a regularization from a class of semiiterative methods. We show that such a discretization approach in combination with a…

Numerical Analysis · Mathematics 2014-07-22 Wolfgang Erb , Evgeniya V. Semenova

Efficient Exploration in Continuous-time Model-based Reinforcement Learning

Reinforcement learning algorithms typically consider discrete-time dynamics, even though the underlying systems are often continuous in time. In this paper, we introduce a model-based reinforcement learning algorithm that represents…

Machine Learning · Computer Science 2023-11-01 Lenart Treven , Jonas Hübotter , Bhavya Sukhija , Florian Dörfler , Andreas Krause

An Improved Algorithm for Learning Drifting Discrete Distributions

We present a new adaptive algorithm for learning discrete distributions under distribution drift. In this setting, we observe a sequence of independent samples from a discrete distribution that is changing over time, and the goal is to…

Machine Learning · Computer Science 2024-03-11 Alessio Mazzetto

Depth-Adaptive Neural Networks from the Optimal Control viewpoint

In recent years, deep learning has been connected with optimal control as a way to define a notion of a continuous underlying learning problem. In this view, neural networks can be interpreted as a discretization of a parametric Ordinary…

Optimization and Control · Mathematics 2020-07-07 Joubine Aghili , Olga Mula

Model-Based Episodic Memory Induces Dynamic Hybrid Controls

Episodic control enables sample efficiency in reinforcement learning by recalling past experiences from an episodic memory. We propose a new model-based episodic memory of trajectories addressing current limitations of episodic control. Our…

Machine Learning · Computer Science 2021-11-09 Hung Le , Thommen Karimpanal George , Majid Abdolshah , Truyen Tran , Svetha Venkatesh