Diaz, Henry; Sala, Antonio; Armesto Ángel, Leopoldo(De Gruyter Open Sp. z o.o., 2020-06)
[EN] The linear programming (LP) approach to solve the Bellman equation in dynamic programming is a well-known option for
finite state and input spaces to obtain an exact solution. However, with function approximation or ...
Armesto, Leopoldo; Sala, Antonio(Universitat Politècnica de València, 2021-12-17)
[EN] Optimal control and reinforcement learning have an associate “value function” which must be suitably approximated. Value function approximation problems usually have different precision requirements in different regions ...
Díaz Iza, Henry Paúl(Universitat Politècnica de València, 2020-03-23)
[ES] La presente Tesis emplea técnicas de programación dinámica y aprendizaje por refuerzo para el control de sistemas no lineales en espacios discretos y continuos. Inicialmente se realiza una revisión de los conceptos ...