English
Related papers

Related papers: Kernel-Based Function Approximation for Average Re…

200 papers

Reinforcement learning (RL) has shown empirical success in various real world settings with complex models and large state-action spaces. The existing analytical results, however, typically focus on settings with a small number of…

Machine Learning · Computer Science 2024-03-15 Sattar Vakili , Julia Olkhovskaya

We consider the regret minimization problem in reinforcement learning (RL) in the episodic setting. In many real-world RL environments, the state and action spaces are continuous or very large. Existing approaches establish regret…

Machine Learning · Computer Science 2022-06-29 Sayak Ray Chowdhury , Rafael Oliveira

Kernel methods, particularly kernel ridge regression (KRR), are time-proven, powerful nonparametric regression techniques known for their rich capacity, analytical simplicity, and computational tractability. The analysis of their predictive…

Statistics Theory · Mathematics 2025-09-23 Xin Bing , Xin He , Chao Wang

Reinforcement Learning (RL) has shown great empirical success in various application domains. The theoretical aspects of the problem have been extensively studied over past decades, particularly under tabular and linear Markov Decision…

Machine Learning · Computer Science 2024-06-24 Sattar Vakili

The classical theory of reinforcement learning (RL) has focused on tabular and linear representations of value functions. Further progress hinges on combining RL with modern function approximators such as kernel functions and deep neural…

Machine Learning · Computer Science 2021-01-01 Zhuoran Yang , Chi Jin , Zhaoran Wang , Mengdi Wang , Michael I. Jordan

One central theme in machine learning is function estimation from sparse and noisy data. An example is supervised learning where the elements of the training set are couples, each containing an input location and an output response. In the…

Machine Learning · Computer Science 2023-10-05 Alberto Giaretta , Mauro Bisiacco , Gianluigi Pillonetto

We study contextual bandits with finitely many actions in which the reward of each arm follows a single-index model with an arm-specific index parameter and an unknown nonparametric link function. We consider a regime in which arms…

Machine Learning · Statistics 2026-03-20 Sakshi Arya , Satarupa Bhattacharjee , Bharath K. Sriperumbudur

We propose a new method for feature learning and function estimation in supervised learning via regularised empirical risk minimisation. Our approach considers functions as expectations of Sobolev functions over all possible one-dimensional…

Machine Learning · Statistics 2025-05-01 Bertille Follain , Francis Bach

We study reinforcement learning (RL) for decision processes with non-Markovian reward, in which high-level knowledge of the task in the form of reward machines is available to the learner. We consider probabilistic reward machines with…

Machine Learning · Computer Science 2024-12-30 Hippolyte Bourel , Anders Jonsson , Odalric-Ambrym Maillard , Chenxiao Ma , Mohammad Sadegh Talebi

Reinforcement Learning (RL) problems are being considered under increasingly more complex structures. While tabular and linear models have been thoroughly explored, the analytical study of RL under nonlinear function approximation,…

Machine Learning · Computer Science 2025-09-12 Aya Kayal , Sattar Vakili , Laura Toni , Alberto Bernacchia

Kernel ridge regression, KRR, is a generalization of linear ridge regression that is non-linear in the data, but linear in the model parameters. Here, we introduce an equivalent formulation of the objective function of KRR, which opens up…

Machine Learning · Statistics 2025-03-10 Oskar Allerbo

Ridgeless regression has garnered attention among researchers, particularly in light of the ``Benign Overfitting'' phenomenon, where models interpolating noisy samples demonstrate robust generalization. However, kernel ridgeless regression…

Machine Learning · Computer Science 2024-06-04 Fan He , Mingzhen He , Lei Shi , Xiaolin Huang , Johan A. K. Suykens

We provide uniform confidence bands for kernel ridge regression (KRR), a widely used nonparametric regression estimator for nonstandard data such as preferences, sequences, and graphs. Despite the prevalence of these data--e.g., student…

Statistics Theory · Mathematics 2025-08-19 Rahul Singh , Suhas Vijaykumar

As reinforcement learning algorithms are being applied to increasingly complicated and realistic tasks, it is becoming increasingly difficult to solve such problems within a practical time frame. Hence, we focus on a \textit{satisficing}…

Artificial Intelligence · Computer Science 2025-04-16 Akihiro Tamatsukuri , Tatsuji Takahashi

In reinforcement learning, two objective functions have been developed extensively in the literature: discounted and averaged rewards. The generalization to an entropy-regularized setting has led to improved robustness and exploration for…

Machine Learning · Computer Science 2025-01-20 Jacob Adamczyk , Volodymyr Makarenko , Stas Tiomkin , Rahul V. Kulkarni

Model-based Reinforcement Learning (MBRL) has been widely adapted due to its sample efficiency. However, existing worst-case regret analysis typically requires optimistic planning, which is not realistic in general. In contrast, motivated…

Machine Learning · Statistics 2023-01-10 Lingxiao Wang , Ping Li

Standard regression techniques, while powerful, are often constrained by predefined, differentiable loss functions such as mean squared error. These functions may not fully capture the desired behavior of a system, especially when dealing…

Machine Learning · Computer Science 2025-08-04 Yongchao Huang

Reinforcement learning (RL) algorithms based on high-dimensional function approximation have achieved tremendous empirical success in large-scale problems with an enormous number of states. However, most analysis of such algorithms gives…

Machine Learning · Computer Science 2022-02-17 Jihao Long , Jiequn Han , Weinan E

Reinforcement learning algorithms typically assume rewards to be sampled from light-tailed distributions, such as Gaussian or bounded. However, a wide variety of real-world systems generate rewards that follow heavy-tailed distributions. We…

Machine Learning · Computer Science 2021-02-26 Vincent Zhuang , Yanan Sui

This paper studies the safe reinforcement learning problem formulated as an episodic finite-horizon tabular constrained Markov decision process with an unknown transition kernel and stochastic reward and cost functions. We propose a…

Machine Learning · Computer Science 2024-10-15 Kihyun Yu , Duksang Lee , William Overman , Dabeen Lee
‹ Prev 1 2 3 10 Next ›