Buscar en RiuNet

Listar

Todo RiuNet

Mi cuenta

Acceder

Ayuda RiuNet

Admin. UPV

Listar por palabra clave "Reinforcement learning"

Mostrando ítems 1-20 de 47

Página siguiente

A Hybrid Simulation and Reinforcement Learning Algorithm for Enhancing Efficiency in Warehouse Operations

Leon, Jonas F.; Li, Yuda; Martin, Xabier A.; Calvet, Laura; Panadero, Javier; Juan, Angel A. (MDPI AG, 2023-09)

[EN] The use of simulation and reinforcement learning can be viewed as a flexible approach to aid managerial decision-making, particularly in the face of growing complexity in manufacturing and logistic systems. Efficient ...
A Learnheuristic Algorithm Based on Thompson Sampling for the Heterogeneous and Dynamic Team Orienteering Problem

Uguina, Antonio R.; Gomez, Juan F; Panadero, Javier; Martínez-Gavara, Anna; Juan, Angel A. (MDPI AG, 2024-06)

[EN] The team orienteering problem (TOP) is a well-studied optimization challenge in the field of Operations Research, where multiple vehicles aim to maximize the total collected rewards within a given time limit by visiting ...
A Learnheuristic Algorithm for the Capacitated Dispersion Problem with Dynamic Conditions

Gómez, Juan F.; Uguina, Antonio R.; Panadero, Javier; Juan, Angel A. (MDPI AG, 2023-12)

[EN] The capacitated dispersion problem, which is a variant of the maximum diversity problem, aims to determine a set of elements within a network. These elements could symbolize, for instance, facilities in a supply chain ...
Adaptive spatial discretization using reinforcement learning

Butt, Jemil; Wieser, Andreas (Editorial Universitat Politècnica de València, 2023-01-27)

[EN] A well-known challenge for deformation monitoring is the spatial discretization, i.e. the choice of monitoring points at which measurements are to be taken. Well-chosen monitoring points employ prior knowledge to yield ...
Aplicación de técnicas de aprendizaje por refuerzo para la navegación de robots móviles

Pescador Barreto, Germán Andrés (Universitat Politècnica de València, 2023-10-21)

[ES] Este trabajo se enfoca en el entrenamiento de un agente robótico de navegación diferencial, utilizando técnicas de aprendizaje por refuerzo. El objetivo principal es capacitar al agente para realizar tareas de navegación ...
Aplicación del aprendizaje por refuerzo profundo para el abastecimiento colaborativo entre cadenas de suministrocompetitivas del sector del calzado

Seni Molina, Mario Jose (Universitat Politècnica de València, 2022-11-03)

[ES] En este trabajo fin de máster se desarrolla un agente inteligente basado en el aprendizaje por refuerzo profundo (Deep Reinforcement Learning) para modelar el proceso de abastecimiento colaborativo entre cadenas de ...
Aprendizaje por refuerzo en logística corporativa: Next Best Action

Obrador Reina, Miquel (Universitat Politècnica de València, 2023-09-22)

[ES] En el campo del aprendizaje por refuerzo se busca entrenar agentes inteligentes para que aprendan a tomar decisiones óptimas en situaciones complejas a través de la interacción con un ambiente. En este trabajo realizado ...
Aprendizaje por refuerzo en sistemas multiagente mediante MARLÖ

Martínez Sanchis, Genís (Universitat Politècnica de València, 2021-02-24)

[ES] En este trabajo de fin de grado se realizará un estudio basado en el análisis de la aplicación de algoritmos de aprendizaje por refuerzo para entornos mono-agente sobre entornos multi-agente basados en la plataforma ...
Aprendizaje por refuerzo mediante métodos de búsqueda de política en sistemas electromecánicos

Pastor Alcaraz, José Manuel (Universitat Politècnica de València, 2017-02-07)

[EN] The aim of this master thesis is to study the state of art of reinforment learning, particularly those based on policy search methods and to apply such techniques to a 3DOFs inverted pendulum mechanism. The controller ...
Artificial intelligent system for multimedia services in smart home environments

REGO MAÑEZ, ALBERT; Gonzalez Ramirez, Pedro Luis; Jimenez, Jose M.; Lloret, Jaime (Springer-Verlag, 2022-06)

[EN] Internet of Things (IoT) has introduced new applications and environments. Smart Home provides new ways of communication and service consumption. In addition, Artificial Intelligence (AI) and deep learning have improved ...
Comparing humans and AI agents

Insa Cabrera, Javier; Dowe, David L.; España Cubillo, Sergio; Henánez-Lloreda, M. Victoria; Hernández Orallo, José (Springer Verlag (Germany), 2011)

Comparing humans and machines is one important source of information about both machine and human strengths and limitations. Most of these comparisons and competitions are performed in rather specific tasks such as ...
Data Homogenization Method for Heterogeneous Sensors Applied to Reinforcement Learning

Palacios-Morocho, Maritza Elizabeth; López-Muñoz, Pablo; Costán, Manuel A.; Monserrat del Río, Jose Francisco (Institute of Electrical and Electronics Engineers, 2023)

[EN] In autonomous navigation and route planning, the data obtained by the different sensors play a significant role. On the one hand, more data will lead to faster learning of the behavioral policy. On the other hand, ...
Desarrollo de controladores modulares de posición/fuerza basados en ROS2 para robots paralelos de rehabilitación de miembro inferior

Ferrándiz Alarcón, Jesús (Universitat Politècnica de València, 2021-10-19)

[ES] Con el presente trabajo se pretende llevar a cabo el desarrollo de controladores usando una arquitectura modular basada en software libre de código abierto. Para ello en la controladora se ejecutará el sistema operativo ...
Design and Performance Analysis of Access Control Mechanisms for Massive Machine-to-Machine Communications in Wireless Cellular Networks

Tello Oquendo, Luis Patricio (Universitat Politècnica de València, 2018-09-10)

En la actualidad, la Internet de las Cosas (Internet of Things, IoT) es una tecnología esencial para la próxima generación de sistemas inalámbricos. La conectividad es la base de IoT, y el tipo de acceso requerido dependerá ...
Design Trend Forecasting by Combining Conceptual Analysis and Semantic Projections: New Tools for Open Innovation

Manetti, Alessandro; Ferrer Sapena, Antonia; Sánchez Pérez, Enrique Alfonso; Lara-Navarra, Pablo (MDPI AG, 2021-03-10)

[EN] In this paper, we describe a new trend analysis and forecasting method (Deflexor), which is intended to help inform decisions in almost any field of human social activity, including, for example, business, art and ...
Development and testing of an embedded control system for the levitation of a Hyperloop vehicle using Reinforcement Learning

Albert Bonet, Hugo (Universitat Politècnica de València, 2024-06-26)

[ES] Hyperloop es el denominado transporte del futuro , un nuevo medio de transporte que emplea la combinación de levitación y vacío para evitar el rozamiento en su trayecto, lo que lo convierte en un medio más rápido, ...
Digital twin for supply chain master planning in zero-defect manufacturing

Serrano, Julio C.; Mula, Josefa; Poler, R. (Springer, 2021-06-30)

[EN] Recently, many novel paradigms, concepts and technologies, which lay the foundation for the new revolution in manufacturing environments, have emerged and make it faster to address critical decisions today in supply ...
Dreaming machine learning: Lipschitz extensions for reinforcement learning on financial markets

Calabuig, J. M.; Falciani, H.; Sánchez Pérez, Enrique Alfonso (Elsevier, 2020-07-20)

[EN] We consider a quasi-metric topological structure for the construction of a new reinforcement learning model in the framework of financial markets. It is based on a Lipschitz type extension of reward functions defined ...
Enhancing Cooperative Multi-Agent Systems With Self-Advice and Near-Neighbor Priority Collision Control

Palacios-Morocho, Maritza Elizabeth; Inca, Saúl; Monserrat del Río, Jose Francisco (Institute of Electrical and Electronics Engineers, 2024)

[EN] The coordination of actions to be executed by multiple independent agents in a dynamic environment is one of the main challenges of multi-agent systems. To address this type of scenario, a key technology called ...
Enseñanza del aprendizaje por refuerzo con un sencillo ejemplo de minimización de funciones

Arnau Notari, Andres Roger; García Raffi, Luis Miguel; Calabuig Rodriguez, Jose Manuel; Sánchez Pérez, Enrique Alfonso (Editorial Universitat Politècnica de València, 2023-10-06)

[ES] En este trabajo se presenta una sesión a modo de taller orientada al estudiantado universitarios para que entiendan los fundamentos del aprendizaje por refuerzo (RL). Esta técnica de inteligencia artificial no es ...