Buscar en RiuNet

Listar

Todo RiuNet

Mi cuenta

Acceder

Ayuda RiuNet

Admin. UPV

Listar por palabra clave "Speech processing"

Mostrando ítems 1-5 de 5

Continuous lipreading based on acoustic temporal alignments

Gimeno-Gómez, David; Martínez-Hinarejos, Carlos-D. (Springer (Biomed Central Ltd.), 2024-05-06)

[EN] Visual speech recognition (VSR) is a challenging task that has received increasing interest during the last few decades. Current state of the art employs powerful end-to-end architectures based on deep learning which ...
Desarrollo de algoritmos para el aprendizaje automático de palabras en idiomas desconocidos a partir de audio

Gómez Requena, David (Universitat Politècnica de València, 2019-10-04)

[ES] En este trabajo van a ser desarrollados algoritmos que sirven para simular el aprendizaje de las lenguas que realizan los niños. El proceso principal consiste en la búsqueda en discursos largos de audio, de segmentos ...
Estimating the number of segments for improving dialogue act labelling

Tamarit Ballester, Vicent; Martínez-Hinarejos, Carlos-D.; Benedí Ruiz, José Miguel (Cambridge University Press (CUP), 2012-01)

In dialogue systems it is important to label the dialogue turns with dialogue-related meaning. Each turn is usually divided into segments and these segments are labelled with dialogue acts (DAs). A DA is a representation ...
Maximum a Posteriori Binary Mask Estimation for Underdetermined Source Separation Using Smoothed Posteriors

Cobos Serrano, Máximo; López Monfort, José Javier (Institute of Electrical and Electronics Engineers (IEEE), 2012-09)

Sound source separation has become a topic of intensive research in the last years. The research effort has been specially relevant for the underdetermined case, where a considerable number of sparse methods working in the ...
Speaker-Adapted Confidence Measures for ASR using Deep Bidirectional Recurrent Neural Networks

Del Agua Teba, Miguel Angel; Giménez Pastor, Adrián; Sanchis Navarro, José Alberto; Civera Saiz, Jorge; Juan, Alfons (Institute of Electrical and Electronics Engineers, 2018)

[EN] In the last years, Deep Bidirectional Recurrent Neural Networks (DBRNN) and DBRNN with Long Short-Term Memory cells (DBLSTM) have outperformed the most accurate classifiers for confidence estimation in automatic speech ...