Jorge-Cano, Javier; Giménez Pastor, Adrián; Silvestre Cerdà, Joan Albert; Civera Saiz, Jorge; Sanchis Navarro, José Alberto; Juan, Alfons(Institute of Electrical and Electronics Engineers, 2022)
[EN] Although Long-Short Term Memory (LSTM) networks and deep Transformers are now extensively used in offline ASR, it is unclear how best offline systems can be adapted to work with them under the streaming setup. After ...
Pérez González de Martos, Alejandro Manuel; Silvestre Cerdà, Joan Albert; Valor Miró, Juan Daniel; Civera Saiz, Jorge; Juan Císcar, Alfonso(Springer, 2015-09-15)
This paper briefly presents the main features of MLLP s Transcription and Translation Platform, which uses state-of-the-art automatic speech recognition and machine translation systems to generate multilingual subtitles ...
Jorge-Cano, Javier; Giménez Pastor, Adrián; Baquero-Arnal, Pau; Iranzo-Sánchez, Javier; Pérez-González de Martos, Alejandro Manuel; Garcés Díaz-Munío, Gonçal; Silvestre Cerdà, Joan Albert; Civera Saiz, Jorge; Sanchis Navarro, José Alberto; Juan, Alfons(2021-03-25)
[EN] This paper describes the automatic speech recognition (ASR) systems built by the MLLP-VRAIN research group of Universitat Politecnica de València for the Albayzin-RTVE 2020 Speech-to-Text Challenge.
The primary system ...
[EN] This paper describes the automatic speech recognition (ASR) systems built by the MLLP-VRAIN research group of Universitat Politècnica de València for the Albayzín-RTVE 2020 Speech-to-Text Challenge, and includes an ...
Alabau Gonzalvo, Vicente(Universitat Politècnica de València, 2014-01-27)
This thesis presents scientific contributions to the field of multimodal interac-
tive structured prediction (MISP). The aim of MISP is to reduce the human
effort required to supervise an automatic output, in an efficient ...
Transcription of digitalised historical documents is an interesting task in the document analysis area.
This transcription can be achieved by using Handwritten Text Recognition (HTR) on digitalised pages or by using ...
Alabau, Vicent; Sanchis Navarro, José Alberto; Casacuberta Nolla, Francisco(Elsevier, 2012-12)
[EN] Interactive structured prediction (ISP) is an emerging framework for structured prediction (SP) where the
user and the system collaborate to produce a high quality output. Typically, search algorithms applied
to ISP ...
Rosso, Paolo; Hurtado Oliver, Lluis Felip; Segarra Soriano, Encarnación; Sanchís Arnal, Emilio(Institute of Electrical and Electronics Engineers (IEEE), 2012-01)
[EN] Question answering (QA) is probably one of the most challenging tasks in the field of natural language processing. It requires search engines that are capable of extracting concise, precise fragments of text that ...
Del Agua Teba, Miguel Angel; Giménez Pastor, Adrián; Sanchis Navarro, José Alberto; Civera Saiz, Jorge; Juan, Alfons(Institute of Electrical and Electronics Engineers, 2018)
[EN] In the last years, Deep Bidirectional Recurrent Neural Networks (DBRNN) and DBRNN with Long Short-Term Memory cells (DBLSTM) have outperformed the most accurate classifiers for confidence estimation in automatic speech ...
Jorge Cano, Javier(Universitat Politècnica de València, 2022-12-30)
[ES] Durante la última década, los medios de comunicación han experimentado una revolución, alejándose de la televisión convencional hacia las plataformas de contenido bajo demanda. Además, esta revolución no ha cambiado ...
Del-Agua, Miguel Ángel; Martínez-Villaronga, Adrià; Giménez Pastor, Adrián; Sanchis Navarro, José Alberto; Civera Saiz, Jorge; Juan, Alfons(CHiME, 2016-09-13)
[EN] The MLLP CHiME-4 system is presented in this paper. It
has been built using the transLectures-UPV toolkit (TLK) developed by the MLLP research group which makes use of stateof-the-art automatic speech recognition ...
Silvestre Cerdà, Joan Albert; Del Agua Teba, Miguel Angel; Garcés Díaz-Munío, Gonzalo Vicente; Gascó Mora, Guillem; Giménez Pastor, Adrián; Martínez-Villaronga, Adrià Agustí; Pérez González de Martos, Alejandro Manuel; Sánchez-Cortina, Isaías; Serrano Martínez-Santos, Nicolás; Spencer, Rachel Nadine; Valor Miró, Juan Daniel; Andrés Ferrer, Jesús; Civera Saiz, Jorge; Sanchis Navarro, José Alberto; Juan Císcar, Alfonso(IberSPEECH 2012, 2012-11-21)
transLectures (Transcription and Translation of Video Lectures)
is an EU STREP project in which advanced automatic speech
recognition and machine translation techniques are being tested on large
video lecture repositories. ...