English
Related papers

Related papers: Sparse Feature Selection Makes Batch Reinforcement…

200 papers

Sparse linear regression is a central problem in high-dimensional statistics. We study the correlated random design setting, where the covariates are drawn from a multivariate Gaussian $N(0,\Sigma)$, and we seek an estimator with small…

Data Structures and Algorithms · Computer Science 2023-05-29 Jonathan Kelner , Frederic Koehler , Raghu Meka , Dhruv Rohatgi

This paper studies offline reinforcement learning with linear function approximation in a setting with decision-theoretic, but not estimation sparsity. The structural restrictions of the data-generating process presume that the transitions…

Machine Learning · Statistics 2024-01-24 Angela Zhou

Improving sample efficiency is central to Reinforcement Learning (RL), especially in environments where the rewards are sparse. Some recent approaches have proposed to specify reward functions as manually designed or learned reward…

Machine Learning · Computer Science 2024-01-26 Shuai Han , Mehdi Dastani , Shihan Wang

The recent work by Dong & Yang (2023) showed for misspecified sparse linear bandits, one can obtain an $O\left(\epsilon\right)$-optimal policy using a polynomial number of samples when the sparsity is a constant, where $\epsilon$ is the…

Machine Learning · Computer Science 2024-07-19 Ally Yalei Du , Lin F. Yang , Ruosong Wang

Reinforcement learning (RL) is a key post-pretraining step for aligning large language models (LLMs) with complex tasks and human preferences. While it is often assumed that RL fine-tuning requires updating most of a model's parameters, we…

Machine Learning · Computer Science 2025-07-30 Andrii Balashov

Feature selection plays a crucial role in improving predictive accuracy by identifying relevant features while filtering out irrelevant ones. This study investigates the importance of effective feature selection in enhancing the performance…

Machine Learning · Computer Science 2024-03-12 Younes Ghazagh Jahed , Seyyed Ali Sadat Tavana

Reinforcement learning (RL) has emerged as a promising strategy for finetuning small language models (SLMs) to solve targeted tasks such as math and coding. However, RL algorithms tend to be resource-intensive, taking a significant amount…

Machine Learning · Computer Science 2025-10-07 Lianghuan Huang , Sagnik Anupam , Insup Lee , Shuo Li , Osbert Bastani

In this paper, we review state-of-the-art methods for feature selection in statistics with an application-oriented eye. Indeed, sparsity is a valuable property and the profusion of research on the topic might have provided little guidance…

Methodology · Statistics 2021-11-08 Dimitris Bertsimas , Jean Pauphilet , Bart Van Parys

Reinforcement Learning (RL) has become essential for eliciting complex reasoning capabilities in Large Language Models (LLMs). However, the substantial memory overhead of storing Key-Value (KV) caches during long-horizon rollouts acts as a…

Machine Learning · Computer Science 2026-03-31 Sijia Luo , Xiaokang Zhang , Yuxuan Hu , Bohan Zhang , Ke Wang , Jinbo Su , Mengshu Sun , Lei Liang , Jing Zhang

In model-free deep reinforcement learning (RL) algorithms, using noisy value estimates to supervise policy evaluation and optimization is detrimental to the sample efficiency. As this noise is heteroscedastic, its effects can be mitigated…

Machine Learning · Computer Science 2022-05-04 Vincent Mai , Kaustubh Mani , Liam Paull

In this paper, we investigate the use of small datasets in the context of offline reinforcement learning (RL). While many common offline RL benchmarks employ datasets with over a million data points, many offline RL applications rely on…

Machine Learning · Computer Science 2025-06-30 Samin Yeasar Arnob , Scott Fujimoto , Doina Precup

Reinforcement learning (RL) has emerged as a promising strategy for improving the reasoning capabilities of language models (LMs) in domains such as mathematics and coding. However, most modern RL algorithms were designed to target robotics…

Artificial Intelligence · Computer Science 2025-05-26 Lianghuan Huang , Shuo Li , Sagnik Anupam , Insup Lee , Osbert Bastani

Reliant on too many experiments to learn good actions, current Reinforcement Learning (RL) algorithms have limited applicability in real-world settings, which can be too expensive to allow exploration. We propose an algorithm for batch RL,…

Machine Learning · Computer Science 2021-12-07 Rasool Fakoor , Jonas Mueller , Kavosh Asadi , Pratik Chaudhari , Alexander J. Smola

Offline reinforcement learning (RL) aims to learn a policy that maximizes the expected return using a given static dataset of transitions. However, offline RL faces the distribution shift problem. The policy constraint offline RL method is…

Machine Learning · Computer Science 2025-12-24 Yuanhao Chen , Qi Liu , Pengbin Chen , Zhongjian Qiao , Yanjie Li

In high-dimensional settings, sparse structures are critical for efficiency in term of memory and computation complexity. For a linear system, to find the sparsest solution provided with an over-complete dictionary of features directly is…

Machine Learning · Statistics 2020-07-09 Yiping Jiang , Tianshi Chen

Reinforcement learning (RL) algorithms are often categorized as either on-policy or off-policy depending on whether they use data from a target policy of interest or from a different behavior policy. In this paper, we study a subtle…

Machine Learning · Computer Science 2022-10-12 Rujie Zhong , Duohan Zhang , Lukas Schäfer , Stefano V. Albrecht , Josiah P. Hanna

While reinforcement learning (RL) methods that learn an internal model of the environment have the potential to be more sample efficient than their model-free counterparts, learning to model raw observations from high dimensional sensors…

Machine Learning · Computer Science 2023-06-27 Raj Ghugare , Homanga Bharadhwaj , Benjamin Eysenbach , Sergey Levine , Ruslan Salakhutdinov

Sparsity-inducing penalties are useful tools for variable selection and they are also effective for regression settings where the data are functions. We consider the problem of selecting not only variables but also decision boundaries in…

Methodology · Statistics 2020-06-01 Hidetoshi Matsui

This dissertation investigates how reinforcement learning (RL) methods can be designed to be safe, sample-efficient, and robust. Framed through the unifying perspective of contextual-bandit RL, the work addresses two major application…

Machine Learning · Computer Science 2025-10-20 Shashank Gupta

This paper considers a class of reinforcement learning problems, which involve systems with two types of states: stochastic and pseudo-stochastic. In such systems, stochastic states follow a stochastic transition kernel while the…

Machine Learning · Computer Science 2023-11-09 Honghao Wei , Xin Liu , Weina Wang , Lei Ying
‹ Prev 1 2 3 10 Next ›