English
Related papers

Related papers: Efficient Reinforcement Learning Using Recursive L…

200 papers

As an important algorithm in deep reinforcement learning, advantage actor critic (A2C) has been widely succeeded in both discrete and continuous control tasks with raw pixel inputs, but its sample efficiency still needs to improve more. In…

Machine Learning · Computer Science 2022-02-15 Yuan Wang , Chunyuan Zhang , Tianzong Yu , Meng Ma

Recursive least squares (RLS) algorithms were once widely used for training small-scale neural networks, due to their fast convergence. However, previous RLS algorithms are unsuitable for training deep neural networks (DNNs), since they…

Machine Learning · Computer Science 2021-09-08 Chunyuan Zhang , Qi Song , Hui Zhou , Yigui Ou , Hongyao Deng , Laurence Tianruo Yang

This paper investigates the optimality analysis of the recursive least-squares (RLS) algorithm for autoregressive systems with exogenous inputs (ARX systems). A key challenge in analyzing is managing the potential unboundedness of the…

Optimization and Control · Mathematics 2025-05-27 Xingrui Liu , Jieming Ke , Yanlong Zhao

Temporal Difference learning or TD($\lambda$) is a fundamental algorithm in the field of reinforcement learning. However, setting TD's $\lambda$ parameter, which controls the timescale of TD updates, is generally left up to the…

Machine Learning · Computer Science 2017-01-02 Timothy A. Mann , Hugo Penedones , Shie Mannor , Todd Hester

New recursive least squares algorithms with rank two updates (RLSR2) that include both exponential and instantaneous forgetting (implemented via a proper choice of the forgetting factor and the window size) are introduced and systematically…

Optimization and Control · Mathematics 2025-07-16 Alexander Stotsky

This work develops robust diffusion recursive least squares algorithms to mitigate the performance degradation often experienced in networks of agents in the presence of impulsive noise. The first algorithm minimizes an exponentially…

Machine Learning · Computer Science 2019-02-05 Y. Yu , H. Zhao , R. C. de Lamare , Y. Zakharov , L. Lu

Reinforcement learning (RL) has been successfully used to solve many continuous control tasks. Despite its impressive results however, fundamental questions regarding the sample complexity of RL on continuous problems remain open. We study…

Machine Learning · Computer Science 2017-12-27 Stephen Tu , Benjamin Recht

Performance analysis of $l_0$ norm constrained Recursive least Squares (RLS) algorithm is attempted in this paper. Though the performance pretty attractive compared to its various alternatives, no thorough study of theoretical analysis has…

Information Theory · Computer Science 2016-02-11 Samrat Mukhopadhyay , Bijit Kumar Das , Mrityunjoy Chakraborty

The recursive least-squares (RLS) algorithm has well-documented merits for reducing complexity and storage requirements, when it comes to online estimation of stationary signals as well as for tracking slowly-varying nonstationary…

Networking and Internet Architecture · Computer Science 2013-10-01 Gonzalo Mateos , Georgios B. Giannakis

This paper presents novel adaptive space-time reduced-rank interference suppression least squares algorithms based on joint iterative optimization of parameter vectors. The proposed space-time reduced-rank scheme consists of a joint…

Information Theory · Computer Science 2013-01-15 Rodrigo C. de Lamare , Raimundo Sampaio-Neto

Temporal difference (TD) methods constitute a class of methods for learning predictions in multi-step prediction problems, parameterized by a recency factor lambda. Currently the most important application of these methods is to temporal…

Artificial Intelligence · Computer Science 2008-02-03 P. Cichosz

This paper presents novel adaptive reduced-rank filtering algorithms based on joint iterative optimization of adaptive filters. The novel scheme consists of a joint iterative optimization of a bank of full-rank adaptive filters that…

Information Theory · Computer Science 2013-04-30 Rodrigo C. de Lamare , Raimundo Sampaio-Neto

The approximation of tensors has important applications in various disciplines, but it remains an extremely challenging task. It is well known that tensors of higher order can fail to have best low-rank approximations, but with an important…

Numerical Analysis · Mathematics 2015-03-19 Mike Espig , Aram Khachatryan

Extremum seeking (ES) optimization approach has been very popular due to its non-model based analysis and implementation. This approach has been mostly used with gradient based search algorithms. Since least squares (LS) algorithms are…

Systems and Control · Electrical Eng. & Systems 2020-03-10 Nursefa Zengin , Baris Fidan

The least trimmed squares (LTS) is a reasonable formulation of robust regression whereas it suffers from high computational cost due to the nonconvexity and nonsmoothness of its objective function. The most frequently used FAST-LTS…

Computation · Statistics 2024-10-08 Shotaro Yagishita

This paper introduces two novel approaches for Online Multi-Task Learning (MTL) Regression Problems. We employ a high performance graph-based MTL formulation and develop two alternative recursive versions based on the Weighted Recursive…

Machine Learning · Statistics 2024-03-19 Gabriel R. Lencione , Fernando J. Von Zuben

The goal of this paper is to propose novel strategies for adaptive learning of signals defined over graphs, which are observed over a (randomly time-varying) subset of vertices. We recast two classical adaptive algorithms in the graph…

Machine Learning · Computer Science 2018-08-01 Paolo Di Lorenzo , Paolo Banelli , Elvin Isufi , Sergio Barbarossa , Geert Leus

Instead of minimizing the sum of all $n$ squared residuals as the classical least squares (LS) does, Rousseeuw (1984) proposed to minimize the sum of $h$ ($n/2 \leq h < n$) smallest squared residuals, the resulting estimator is called least…

Computation · Statistics 2022-10-13 Yijun Zuo

We develop a recursive total least-squares (RTLS) algorithm for errors-in-variables system identification utilizing the inverse power method and the dichotomous coordinate-descent (DCD) iterations. The proposed algorithm, called DCD-RTLS,…

Systems and Control · Computer Science 2015-06-22 Reza Arablouei , Kutluyıl Doğançay , Stefan Werner

The low multilinear rank approximation, also known as the truncated Tucker decomposition, has been extensively utilized in many applications that involve higher-order tensors. Popular methods for low multilinear rank approximation usually…

Numerical Analysis · Mathematics 2021-04-05 Chuanfu Xiao , Chao Yang , Min Li
‹ Prev 1 2 3 10 Next ›