English
Related papers

Related papers: Deviation optimal learning using greedy Q-aggregat…

200 papers

The model averaging problem is to average multiple models to achieve a prediction accuracy not much worse than that of the best single model in terms of mean squared error. It is known that if the models are misspecified, model averaging is…

Statistics Theory · Mathematics 2018-02-28 Dong Dai , Lei Han , Ting Yang , Tong Zhang

We consider the problem of model selection type aggregation in the context of density estimation. We first show that empirical risk minimization is sub-optimal for this problem and it shares this property with the exponential weights…

Statistics Theory · Mathematics 2016-09-29 Pierre C. Bellec

We consider a general supervised learning problem with strongly convex and Lipschitz loss and study the problem of model selection aggregation. In particular, given a finite dictionary functions (learners) together with the prior, we…

Statistics Theory · Mathematics 2014-02-28 Guillaume Lecué , Philippe Rigollet

Greedy-GQ with linear function approximation, originally proposed in \cite{maei2010toward}, is a value-based off-policy algorithm for optimal control in reinforcement learning, and it has a non-linear two timescale structure with the…

Machine Learning · Computer Science 2024-05-03 Yue Wang , Yi Zhou , Shaofeng Zou

Given a dictionary of $M_n$ initial estimates of the unknown true regression function, we aim to construct linearly aggregated estimators that target the best performance among all the linear combinations under a sparse $q$-norm ($0 \leq q…

Statistics Theory · Mathematics 2012-01-16 Zhan Wang , Sandra Paterlini , Frank Gao , Yuhong Yang

Motivated by modern applications such as computerized adaptive testing, sequential rank aggregation, and heterogeneous data source selection, we study the problem of active sequential estimation, which involves adaptively selecting…

Statistics Theory · Mathematics 2024-02-14 Xiaoou Li , Hongru Zhao

This paper studies statistical aggregation procedures in regression setting. A motivating factor is the existence of many different methods of estimation, leading to possibly competing estimators. We consider here three different types of…

Statistics Theory · Mathematics 2007-06-13 Florentina Bunea , Alexandre Tsybakov , Marten Wegkamp

This paper presents a novel systematic methodology to obtain new simple and tight approximations, lower bounds, and upper bounds for the Gaussian Q-function, and functions thereof, in the form of a weighted sum of exponential functions.…

Signal Processing · Electrical Eng. & Systems 2020-12-21 Islam M. Tanash , Taneli Riihonen

The approximation of a discrete probability distribution $\mathbf{t}$ by an $M$-type distribution $\mathbf{p}$ is considered. The approximation error is measured by the informational divergence $\mathbb{D}(\mathbf{t}\Vert\mathbf{p})$, which…

Information Theory · Computer Science 2016-07-28 Bernhard C. Geiger , Georg Böcherer

We study the problem of selecting a subset of vectors from a large set, to obtain the best signal representation over a family of functions. Although greedy methods have been widely used for tackling this problem and many of those have been…

Signal Processing · Electrical Eng. & Systems 2023-05-16 Ehsan Tohidi , Mario Coutino , David Gesbert

We consider the problem of approximating a given element $f$ from a Hilbert space $\mathcal{H}$ by means of greedy algorithms and the application of such procedures to the regression problem in statistical learning theory. We improve on the…

Statistics Theory · Mathematics 2009-09-29 Andrew R. Barron , Albert Cohen , Wolfgang Dahmen , Ronald A. DeVore

The current paper studies the problem of agnostic $Q$-learning with function approximation in deterministic systems where the optimal $Q$-function is approximable by a function in the class $\mathcal{F}$ with approximation error $\delta \ge…

Machine Learning · Computer Science 2020-02-18 Simon S. Du , Jason D. Lee , Gaurav Mahajan , Ruosong Wang

Kernel based methods provide a way to reconstruct potentially high-dimensional functions from meshfree samples, i.e., sampling points and corresponding target values. A crucial ingredient for this to be successful is the distribution of the…

Numerical Analysis · Mathematics 2021-05-19 Tizian Wenzel , Gabriele Santin , Bernard Haasdonk

Delusional bias is a fundamental source of error in approximate Q-learning. To date, the only techniques that explicitly address delusion require comprehensive search using tabular value estimates. In this paper, we develop efficient…

Machine Learning · Computer Science 2020-03-02 Andy Su , Jayden Ooi , Tyler Lu , Dale Schuurmans , Craig Boutilier

Greedy-GQ is an off-policy two timescale algorithm for optimal control in reinforcement learning. This paper develops the first finite-sample analysis for the Greedy-GQ algorithm with linear function approximation under Markovian noise. Our…

Machine Learning · Computer Science 2020-05-21 Yue Wang , Shaofeng Zou

This paper provides a theoretical understanding of Deep Q-Network (DQN) with the $\varepsilon$-greedy exploration in deep reinforcement learning. Despite the tremendous empirical achievement of the DQN, its theoretical characterization…

Machine Learning · Computer Science 2023-10-26 Shuai Zhang , Hongkang Li , Meng Wang , Miao Liu , Pin-Yu Chen , Songtao Lu , Sijia Liu , Keerthiram Murugesan , Subhajit Chaudhury

We introduce an alternative to the notion of `fast rate' in Learning Theory, which coincides with the optimal error rate when the given class happens to be convex and regular in some sense. While it is well known that such a rate cannot…

Statistics Theory · Mathematics 2015-02-26 Shahar Mendelson

Full-sampling (e.g., Q-learning) and pure-expectation (e.g., Expected Sarsa) algorithms are efficient and frequently used techniques in reinforcement learning. Q$(\sigma,\lambda)$ is the first approach unifies them with eligibility trace…

Machine Learning · Computer Science 2019-09-09 Long Yang , Yu Zhang , Qian Zheng , Pengfei Li , Gang Pan

Aggregation methods have emerged as a powerful and flexible framework in statistical learning, providing unified solutions across diverse problems such as regression, classification, and density estimation. In the context of generalized…

Statistics Theory · Mathematics 2025-04-15 The Tien Mai

Ensembles of independently trained neural networks are a state-of-the-art approach to estimate predictive uncertainty in Deep Learning, and can be interpreted as an approximation of the posterior distribution via a mixture of delta…

Machine Learning · Computer Science 2022-07-11 Aleksei Tiulpin , Matthew B. Blaschko
‹ Prev 1 2 3 10 Next ›