English
Related papers

Related papers: Adaptive Discretization for Model-Based Reinforcem…

200 papers

We present an efficient algorithm for model-free episodic reinforcement learning on large (potentially continuous) state-action spaces. Our algorithm is based on a novel $Q$-learning policy with adaptive data-driven discretization. The…

Machine Learning · Computer Science 2019-12-20 Sean R. Sinclair , Siddhartha Banerjee , Christina Lee Yu

Discretization based approaches to solving online reinforcement learning problems have been studied extensively in practice on applications ranging from resource allocation to cache management. Two major questions in designing…

Machine Learning · Statistics 2024-09-30 Sean R. Sinclair , Siddhartha Banerjee , Christina Lee Yu

We study reinforcement learning for controlled diffusion processes with unbounded continuous state spaces, bounded continuous actions, and polynomially growing rewards: settings that arise naturally in finance, economics, and operations…

Machine Learning · Computer Science 2025-12-18 Hanqing Jin , Renyuan Xu , Yanzhao Yang

We develop a novel iterative algorithm for locally optimal experimental design under constraints, like budget or performance constraints. It is an adaptive discretization algorithm. In every iteration, a discretized version of the…

Optimization and Control · Mathematics 2026-04-21 Jochen Schmid , Philipp Seufert , Jan Schwientek , Tobias Seidel , Karl-Heinz Küfer

We study unconstrained Online Linear Optimization with Lipschitz losses. Motivated by the pursuit of instance optimality, we propose a new algorithm that simultaneously achieves ($i$) the AdaGrad-style second order gradient adaptivity; and…

Machine Learning · Computer Science 2024-02-23 Zhiyu Zhang , Heng Yang , Ashok Cutkosky , Ioannis Ch. Paschalidis

Learning actions that are relevant to decision-making and can be executed effectively is a key problem in autonomous robotics. Current state-of-the-art action representations in robotics lack proper effect-driven learning of the robot's…

Robotics · Computer Science 2024-04-04 Marko Zaric , Jakob Hollenstein , Justus Piater , Erwan Renaudo

We develop adaptive discretization algorithms for locally optimal experimental design of nonlinear prediction models. With these algorithms, we refine and improve a pertinent state-of-the-art algorithm in various respects. We establish…

Optimization and Control · Mathematics 2024-06-04 Jochen Schmid , Philipp Seufert , Michael Bortz

Despite the wealth of research into provably efficient reinforcement learning algorithms, most works focus on tabular representation and thus struggle to handle exponentially or infinitely large state-action spaces. In this paper, we…

Machine Learning · Computer Science 2020-03-10 Ahmed Touati , Adrien Ali Taiga , Marc G. Bellemare

Strong worst-case performance bounds for episodic reinforcement learning exist but fortunately in practice RL algorithms perform much better than such bounds would predict. Algorithms and theory that provide strong problem-dependent bounds…

Machine Learning · Computer Science 2019-11-05 Andrea Zanette , Emma Brunskill

In this paper, we study the problem of regret minimization for episodic Reinforcement Learning (RL) both in the model-free and the model-based setting. We focus on learning with general function classes and general model classes, and we…

Machine Learning · Computer Science 2022-03-04 Grigoris Velegkas , Zhuoran Yang , Amin Karbasi

Lipschitz bandits is a prominent version of multi-armed bandits that studies large, structured action spaces such as the $[0,1]$ interval, where similar actions are guaranteed to have similar rewards. A central theme here is the adaptive…

Machine Learning · Computer Science 2025-06-13 Chara Podimata , Aleksandrs Slivkins

Episodic self-imitation learning, a novel self-imitation algorithm with a trajectory selection module and an adaptive loss function, is proposed to speed up reinforcement learning. Compared to the original self-imitation learning algorithm,…

Artificial Intelligence · Computer Science 2020-11-30 Tianhong Dai , Hengyan Liu , Anil Anthony Bharath

Finite element discretizations of problems in computational physics often rely on adaptive mesh refinement (AMR) to preferentially resolve regions containing important features during simulation. However, these spatial refinement strategies…

Computational Engineering, Finance, and Science · Computer Science 2022-09-27 Corbin Foucart , Aaron Charous , Pierre F. J. Lermusiaux

We present a powerful general framework for designing data-dependent optimization algorithms, building upon and unifying recent techniques in adaptive regularization, optimistic gradient predictions, and problem-dependent randomization. We…

Machine Learning · Statistics 2015-10-14 Mehryar Mohri , Scott Yang

We present an adaptive regularization algorithm that can be effectively applied to the optimization problem in deep learning framework. Our regularization algorithm aims to take into account the fitness of data to the current state of model…

Machine Learning · Computer Science 2019-09-02 Junghee Cho , Junseok Kwon , Byung-Woo Hong

In this paper we investigate an adaptive discretization strategy for ill-posed linear prob- lems combined with a regularization from a class of semiiterative methods. We show that such a discretization approach in combination with a…

Numerical Analysis · Mathematics 2014-07-22 Wolfgang Erb , Evgeniya V. Semenova

Reinforcement learning algorithms typically consider discrete-time dynamics, even though the underlying systems are often continuous in time. In this paper, we introduce a model-based reinforcement learning algorithm that represents…

Machine Learning · Computer Science 2023-11-01 Lenart Treven , Jonas Hübotter , Bhavya Sukhija , Florian Dörfler , Andreas Krause

We present a new adaptive algorithm for learning discrete distributions under distribution drift. In this setting, we observe a sequence of independent samples from a discrete distribution that is changing over time, and the goal is to…

Machine Learning · Computer Science 2024-03-11 Alessio Mazzetto

In recent years, deep learning has been connected with optimal control as a way to define a notion of a continuous underlying learning problem. In this view, neural networks can be interpreted as a discretization of a parametric Ordinary…

Optimization and Control · Mathematics 2020-07-07 Joubine Aghili , Olga Mula

Episodic control enables sample efficiency in reinforcement learning by recalling past experiences from an episodic memory. We propose a new model-based episodic memory of trajectories addressing current limitations of episodic control. Our…

Machine Learning · Computer Science 2021-11-09 Hung Le , Thommen Karimpanal George , Majid Abdolshah , Truyen Tran , Svetha Venkatesh
‹ Prev 1 2 3 10 Next ›