Gimeno-Gómez, David; Martínez-Hinarejos, Carlos-D.(Springer (Biomed Central Ltd.), 2024-05-06)
[EN] Visual speech recognition (VSR) is a challenging task that has received increasing interest during the last few decades. Current state of the art employs powerful end-to-end architectures based on deep learning which ...
Gómez Requena, David(Universitat Politècnica de València, 2019-10-04)
[ES] En este trabajo van a ser desarrollados algoritmos que sirven para simular el
aprendizaje de las lenguas que realizan los niños. El proceso principal consiste en la
búsqueda en discursos largos de audio, de segmentos ...
Tamarit Ballester, Vicent; Martínez-Hinarejos, Carlos-D.; Benedí Ruiz, José Miguel(Cambridge University Press (CUP), 2012-01)
In dialogue systems it is important to label the dialogue turns with dialogue-related meaning. Each turn is usually divided into segments and these segments are labelled with dialogue acts (DAs). A DA is a representation ...
Cobos Serrano, Máximo; López Monfort, José Javier(Institute of Electrical and Electronics Engineers (IEEE), 2012-09)
Sound source separation has become a topic of intensive research in the last years. The research effort has been specially relevant for the underdetermined case, where a considerable number of sparse methods working in the ...
Del Agua Teba, Miguel Angel; Giménez Pastor, Adrián; Sanchis Navarro, José Alberto; Civera Saiz, Jorge; Juan, Alfons(Institute of Electrical and Electronics Engineers, 2018)
[EN] In the last years, Deep Bidirectional Recurrent Neural Networks (DBRNN) and DBRNN with Long Short-Term Memory cells (DBLSTM) have outperformed the most accurate classifiers for confidence estimation in automatic speech ...