Jorge Cano, Javier(Universitat Politècnica de València, 2016-05-27)
[EN] Nowadays, the research on computer vision and machine learning is in its best moment.
The computational capacity and communications currently available in any device, have
risen new challenges. Among them, the task ...
Pérez González de Martos, Alejandro Manuel; Giménez Pastor, Adrián; Jorge Cano, Javier; Iranzo Sánchez, Javier; Silvestre Cerdà, Joan Albert; Garcés Díaz-Munío, Gonzalo Vicente; Baquero Arnal, Pau; Sanchis Navarro, José Alberto; Civera Saiz, Jorge; Juan Císcar, Alfonso; Turró Ribalta, Carlos(Editorial Universitat Politècnica de València, 2023-01-09)
[EN] More and more universities are banking on the production of digital contents to support
online or blended learning in higher education. Over the last years, the MLLP research
group has been working closely with the ...
Jorge-Cano, Javier; Vieco Pérez, Jesús; Paredes Palacios, Roberto; Sánchez Peiró, Joan Andreu; Benedí Ruiz, José Miguel(ScitePress, 2018-01-29)
Since the beginning of Neural Networks, different mechanisms have been required to provide a sufficient number of examples to avoid overfitting. Data augmentation, the most common one, is focused on the generation of new ...
Jorge Cano, Javier(Universitat Politècnica de València, 2019-01-15)
En este Trabajo Final de Máster se pretende desplegar y aprovisionar de forma desatendida y distribuida un conjunto de nodos que lleven a cabo el entrenamiento de un modelo de deep learning (DL). Para este fin, se utilizará ...
Garcés Díaz-Munío, Gonçal; Silvestre Cerdà, Joan Albert; Jorge-Cano, Javier; Giménez Pastor, Adrián; Iranzo-Sánchez, Javier; Baquero-Arnal, Pau; Roselló, Nahuel; Pérez-González de Martos, Alejandro Manuel; Civera Saiz, Jorge; Sanchis Navarro, José Alberto; Juan, Alfons(International Speech Communication Association (ISCA), 2021-09-03)
[EN] We introduce Europarl-ASR, a large speech and text corpus of parliamentary debates including 1300 hours of transcribed speeches and 70 million tokens of text in English extracted from European Parliament sessions. The ...
[EN] Current research into spoken language translation (SLT), or speech-to-text translation, is often hampered by the lack of specific data resources for this task, as currently available SLT datasets are restricted to a ...
Jorge Cano, Javier(Universitat Politècnica de València, 2015-03-23)
[ES] La actual industria del entretenimiento exige la generación de contenidos
de diversa índole, tanto para ofrecer al jugador nuevas experiencias, como
para sorprender al espectador con nuevos planteamientos. Dentro ...
Jorge-Cano, Javier; Giménez Pastor, Adrián; Silvestre Cerdà, Joan Albert; Civera Saiz, Jorge; Sanchis Navarro, José Alberto; Juan, Alfons(Institute of Electrical and Electronics Engineers, 2022)
[EN] Although Long-Short Term Memory (LSTM) networks and deep Transformers are now extensively used in offline ASR, it is unclear how best offline systems can be adapted to work with them under the streaming setup. After ...
Jorge-Cano, Javier; Giménez Pastor, Adrián; Baquero-Arnal, Pau; Iranzo-Sánchez, Javier; Pérez-González de Martos, Alejandro Manuel; Garcés Díaz-Munío, Gonçal; Silvestre Cerdà, Joan Albert; Civera Saiz, Jorge; Sanchis Navarro, José Alberto; Juan, Alfons(2021-03-25)
[EN] This paper describes the automatic speech recognition (ASR) systems built by the MLLP-VRAIN research group of Universitat Politecnica de València for the Albayzin-RTVE 2020 Speech-to-Text Challenge.
The primary system ...
[EN] This paper describes the automatic speech recognition (ASR) systems built by the MLLP-VRAIN research group of Universitat Politècnica de València for the Albayzín-RTVE 2020 Speech-to-Text Challenge, and includes an ...
Iranzo-Sánchez, Javier; Jorge-Cano, Javier; Pérez-González de Martos, Alejandro; Giménez, Adrián; Garcés Díaz-Munío, Gonçal; Baquero-Arnal, Pau; Silvestre Cerdà, Joan Albert; Civera Saiz, Jorge; Sanchis Navarro, José Alberto; Juan, Alfons(Association for Computational Linguistics (ACL), 2022-05-27)
[EN] This work describes the participation of the MLLP-VRAIN research group in the two shared tasks of the IWSLT 2022 conference: Simultaneous Speech Translation and Speech-to-Speech Translation. We present our streaming-ready ...
[EN] Nowadays, there is an increasing demand for machine learning techniques which can deal with problems where the instances are produced as a stream or in real time. In these scenarios, online learning is able to learn ...
Jorge Cano, Javier(Universitat Politècnica de València, 2022-12-30)
[ES] Durante la última década, los medios de comunicación han experimentado una revolución, alejándose de la televisión convencional hacia las plataformas de contenido bajo demanda. Además, esta revolución no ha cambiado ...
[EN] The cascade approach to Speech Translation (ST) is based on a pipeline that concatenates an Automatic Speech Recognition (ASR) system followed by a Machine Translation (MT) system. Nowadays, state-of-the-art ST systems ...
[EN] The exponential growth of social networks makes fingerprint
let by users on the Internet a great source of information,
with data about their preferences, needs, goals, profile and
social environment. These data ...
Nowadays, social networks have an enormous impact in the
society generating a lot of useful information to be employed
in new social applications. In this paper, we show how we
have used a graph-based model to extract ...