arXiv Analytics

Sign in

arXiv:cs/0306120 [cs.LG]AbstractReferencesReviewsResources

Reinforcement Learning with Linear Function Approximation and LQ control Converges

Istvan Szita, Andras Lorincz

Published 2003-06-22, updated 2007-03-09Version 2

Reinforcement learning is commonly used with function approximation. However, very few positive results are known about the convergence of function approximation based RL control algorithms. In this paper we show that TD(0) and Sarsa(0) with linear function approximation is convergent for a simple class of problems, where the system is linear and the costs are quadratic (the LQ control problem). Furthermore, we show that for systems with Gaussian noise and non-completely observable states (the LQG problem), the mentioned RL algorithms are still convergent, if they are combined with Kalman filtering.

Related articles: Most relevant | Search more
arXiv:2011.11566 [cs.LG] (Published 2020-11-23)
Logarithmic Regret for Reinforcement Learning with Linear Function Approximation
arXiv:2106.11612 [cs.LG] (Published 2021-06-22)
Uniform-PAC Bounds for Reinforcement Learning with Linear Function Approximation
arXiv:2310.18919 [cs.LG] (Published 2023-10-29)
Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation