Buscar en RiuNet

Listar

Todo RiuNet

Mi cuenta

Acceder

Ayuda RiuNet

Admin. UPV

Listar por autor "Jorge Cano, Javier"

Mostrando ítems 1-16 de 16

Clasificación de vídeos mediante Redes Neuronales Artificiales

Jorge Cano, Javier (Universitat Politècnica de València, 2016-05-27)

[EN] Nowadays, the research on computer vision and machine learning is in its best moment. The computational capacity and communications currently available in any device, have risen new challenges. Among them, the task ...
Doblaje automático de vídeo-charlas educativas en UPV[Media]

Pérez González de Martos, Alejandro Manuel; Giménez Pastor, Adrián; Jorge Cano, Javier; Iranzo Sánchez, Javier; Silvestre Cerdà, Joan Albert; Garcés Díaz-Munío, Gonzalo Vicente; Baquero Arnal, Pau; Sanchis Navarro, José Alberto; Civera Saiz, Jorge; Juan Císcar, Alfonso; Turró Ribalta, Carlos (Editorial Universitat Politècnica de València, 2023-01-09)

[EN] More and more universities are banking on the production of digital contents to support online or blended learning in higher education. Over the last years, the MLLP research group has been working closely with the ...
Empirical Evaluation of Variational Autoencoders for Data Augmentation

Jorge-Cano, Javier; Vieco Pérez, Jesús; Paredes Palacios, Roberto; Sánchez Peiró, Joan Andreu; Benedí Ruiz, José Miguel (ScitePress, 2018-01-29)

Since the beginning of Neural Networks, different mechanisms have been required to provide a sufficient number of examples to avoid overfitting. Data augmentation, the most common one, is focused on the generation of new ...
Entrenamiento Escalable de Modelos de Deep Learning sobre Infraestructuras Cloud

Jorge Cano, Javier (Universitat Politècnica de València, 2019-01-15)

En este Trabajo Final de Máster se pretende desplegar y aprovisionar de forma desatendida y distribuida un conjunto de nodos que lleven a cabo el entrenamiento de un modelo de deep learning (DL). Para este fin, se utilizará ...
Europarl-ASR: A Large Corpus of Parliamentary Debates for Streaming ASR Benchmarking and Speech Data Filtering/Verbatimization

Garcés Díaz-Munío, Gonçal; Silvestre Cerdà, Joan Albert; Jorge-Cano, Javier; Giménez Pastor, Adrián; Iranzo-Sánchez, Javier; Baquero-Arnal, Pau; Roselló, Nahuel; Pérez-González de Martos, Alejandro Manuel; Civera Saiz, Jorge; Sanchis Navarro, José Alberto; Juan, Alfons (International Speech Communication Association (ISCA), 2021-09-03)

[EN] We introduce Europarl-ASR, a large speech and text corpus of parliamentary debates including 1300 hours of transcribed speeches and 70 million tokens of text in English extracted from European Parliament sessions. The ...
Europarl-ST: A Multilingual Corpus for Speech Translation of Parliamentary Debates

Iranzo-Sánchez, Javier; Silvestre Cerdà, Joan Albert; Jorge-Cano, Javier; Roselló, Nahuel; Giménez, Adriá; Sanchis Navarro, José Alberto; Civera Saiz, Jorge; Juan, Alfons (IEEE, 2020-05-08)

[EN] Current research into spoken language translation (SLT), or speech-to-text translation, is often hampered by the lack of specific data resources for this task, as currently available SLT datasets are restricted to a ...
Generación automática de entornos urbanos y su visualización interactiva en 3D

Jorge Cano, Javier (Universitat Politècnica de València, 2015-03-23)

[ES] La actual industria del entretenimiento exige la generación de contenidos de diversa índole, tanto para ofrecer al jugador nuevas experiencias, como para sorprender al espectador con nuevos planteamientos. Dentro ...
Live Streaming Speech Recognition Using Deep Bidirectional LSTM Acoustic Models and Interpolated Language Models

Jorge-Cano, Javier; Giménez Pastor, Adrián; Silvestre Cerdà, Joan Albert; Civera Saiz, Jorge; Sanchis Navarro, José Alberto; Juan, Alfons (Institute of Electrical and Electronics Engineers, 2022)

[EN] Although Long-Short Term Memory (LSTM) networks and deep Transformers are now extensively used in offline ASR, it is unclear how best offline systems can be adapted to work with them under the streaming setup. After ...
MLLP-VRAIN Spanish ASR Systems for the Albayzin-RTVE 2020 Speech-To-Text Challenge

Jorge-Cano, Javier; Giménez Pastor, Adrián; Baquero-Arnal, Pau; Iranzo-Sánchez, Javier; Pérez-González de Martos, Alejandro Manuel; Garcés Díaz-Munío, Gonçal; Silvestre Cerdà, Joan Albert; Civera Saiz, Jorge; Sanchis Navarro, José Alberto; Juan, Alfons (2021-03-25)

[EN] This paper describes the automatic speech recognition (ASR) systems built by the MLLP-VRAIN research group of Universitat Politecnica de València for the Albayzin-RTVE 2020 Speech-to-Text Challenge. The primary system ...
MLLP-VRAIN Spanish ASR Systems for the Albayzín-RTVE 2020 Speech-to-Text Challenge: Extension

Baquero-Arnal, Pau; Jorge-Cano, Javier; Giménez Pastor, Adrián; Iranzo-Sánchez, Javier; Pérez-González de Martos, Alejandro Manuel; Garcés Díaz-Munío, Gonçal; Silvestre Cerdà, Joan Albert; Civera Saiz, Jorge; Sanchis Navarro, José Alberto; Juan, Alfons (MDPI AG, 2022-01)

[EN] This paper describes the automatic speech recognition (ASR) systems built by the MLLP-VRAIN research group of Universitat Politècnica de València for the Albayzín-RTVE 2020 Speech-to-Text Challenge, and includes an ...
MLLP-VRAIN UPV systems for the IWSLT 2022 Simultaneous Speech Translation and Speech-to-Speech Translation tasks

Iranzo-Sánchez, Javier; Jorge-Cano, Javier; Pérez-González de Martos, Alejandro; Giménez, Adrián; Garcés Díaz-Munío, Gonçal; Baquero-Arnal, Pau; Silvestre Cerdà, Joan Albert; Civera Saiz, Jorge; Sanchis Navarro, José Alberto; Juan, Alfons (Association for Computational Linguistics (ACL), 2022-05-27)

[EN] This work describes the participation of the MLLP-VRAIN research group in the two shared tasks of the IWSLT 2022 conference: Simultaneous Speech Translation and Speech-to-Speech Translation. We present our streaming-ready ...
Passive-Aggressive online learning with nonlinear embeddings

Jorge-Cano, Javier; Paredes Palacios, Roberto (Elsevier, 2018)

[EN] Nowadays, there is an increasing demand for machine learning techniques which can deal with problems where the instances are produced as a stream or in real time. In these scenarios, online learning is able to learn ...
Streaming Automatic Speech Recognition with Hybrid Architectures and Deep Neural Network Models

Jorge Cano, Javier (Universitat Politècnica de València, 2022-12-30)

[ES] Durante la última década, los medios de comunicación han experimentado una revolución, alejándose de la televisión convencional hacia las plataformas de contenido bajo demanda. Además, esta revolución no ha cambiado ...
Streaming cascade-based speech translation leveraged by a direct segmentation model

Iranzo-Sánchez, Javier; Jorge-Cano, Javier; Baquero-Arnal, Pau; Silvestre Cerdà, Joan Albert; Giménez Pastor, Adrián; Civera Saiz, Jorge; Sanchis Navarro, José Alberto; Juan, Alfons (Elsevier, 2021-05-31)

[EN] The cascade approach to Speech Translation (ST) is based on a pipeline that concatenates an Automatic Speech Recognition (ASR) system followed by a Machine Translation (MT) system. Nowadays, state-of-the-art ST systems ...
Towards persuasive social recommendation: knowledge model

Palanca Cámara, Javier; Heras Barberá, Stella María; Jorge Cano, Javier; Julian Inglada, Vicente Javier (2015-06)

[EN] The exponential growth of social networks makes fingerprint let by users on the Internet a great source of information, with data about their preferences, needs, goals, profile and social environment. These data ...
Using Graph-Based Models in a Persuasive Social Recommendation System

Palanca Cámara, Javier; Heras Barberá, Stella María; Jorge Cano, Javier; Julian Inglada, Vicente Javier (ACM, 2015-04)

Nowadays, social networks have an enormous impact in the society generating a lot of useful information to be employed in new social applications. In this paper, we show how we have used a graph-based model to extract ...