Díaz, Henry; Armesto, Leopoldo; Sala, Antonio(Universitat Politècnica de València, 2019-06-12)
[EN] In this article, we present a methodology for learning data-based approximately optimal controllers, within the context of learning and approximate dynamic programming. There are previous solutions in dynamic programming ...
Armesto, Leopoldo; Sala, Antonio(Universitat Politècnica de València, 2021-12-17)
[EN] Optimal control and reinforcement learning have an associate “value function” which must be suitably approximated. Value function approximation problems usually have different precision requirements in different regions ...