Speaker-Adapted Confidence Measures for ASR using Deep Bidirectional Recurrent Neural Networks

Del Agua Teba, Miguel Angel; Giménez Pastor, Adrián; Sanchis Navarro, José Alberto; Civera Saiz, Jorge; Juan, Alfons

doi:10.1109/TASLP.2018.2819900

Identificarse

Buscar en RiuNet

Listar

Todo RiuNet
Esta colección

Mi cuenta

Acceder

Estadísticas

Ver Estadísticas de uso

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Speaker-Adapted Confidence Measures for ASR using Deep Bidirectional Recurrent Neural Networks

Mostrar el registro sencillo del ítem

Ficheros en el ítem

Nombre: double.pdf

Tamaño: 622.4Kb

Formato: PDF

Descripción: Versión del Autor.

Abrir

Nombre: editorial.pdf

Tamaño: 978.5Kb

Formato: PDF

Descripción: Versión editorial

Solicitar una copia al autor

dc.contributor.author	Del Agua Teba, Miguel Angel	es_ES
dc.contributor.author	Giménez Pastor, Adrián	es_ES
dc.contributor.author	Sanchis Navarro, José Alberto	es_ES
dc.contributor.author	Civera Saiz, Jorge	es_ES
dc.contributor.author	Juan, Alfons	es_ES
dc.date.accessioned	2019-05-31T20:43:40Z
dc.date.available	2019-05-31T20:43:40Z
dc.date.issued	2018	es_ES
dc.identifier.issn	2329-9290	es_ES
dc.identifier.uri	http://hdl.handle.net/10251/121369
dc.description	© 2018 IEEE. Personal use of this material is permitted. Permissíon from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertisíng or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
dc.description.abstract	[EN] In the last years, Deep Bidirectional Recurrent Neural Networks (DBRNN) and DBRNN with Long Short-Term Memory cells (DBLSTM) have outperformed the most accurate classifiers for confidence estimation in automatic speech recognition. At the same time, we have recently shown that speaker adaptation of confidence measures using DBLSTM yields significant improvements over non-adapted confidence measures. In accordance with these two recent contributions to the state of the art in confidence estimation, this paper presents a comprehensive study of speaker-adapted confidence measures using DBRNN and DBLSTM models. Firstly, we present new empirical evidences of the superiority of RNN-based confidence classifiers evaluated over a large speech corpus consisting of the English LibriSpeech and the Spanish poliMedia tasks. Secondly, we show new results on speaker-adapted confidence measures considering a multi-task framework in which RNN-based confidence classifiers trained with LibriSpeech are adapted to speakers of the TED-LIUM corpus. These experiments confirm that speaker-adapted confidence measures outperform their non-adapted counterparts. Lastly, we describe an unsupervised adaptation method of the acoustic DBLSTM model based on confidence measures which results in better automatic speech recognition performance.	es_ES
dc.description.sponsorship	This work was supported in part by the European Union's Horizon 2020 research and innovation programme under Grant 761758 (X5gon), in part by the Seventh Framework Programme (FP7/2007-2013) under Grant 287755 (transLectures), in part by the ICT Policy Support Programme (ICT PSP/2007-2013) as part of the Competitiveness and Innovation Framework Programme under Grant 621030 (EMMA), and in part by the Spanish Government's TIN2015-68326-R (MINECO/FEDER) research project MORE.	es_ES
dc.language	Inglés	es_ES
dc.publisher	Institute of Electrical and Electronics Engineers	es_ES
dc.relation.ispartof	IEEE/ACM Transactions on Audio Speech and Language Processing	es_ES
dc.rights	Reserva de todos los derechos	es_ES
dc.subject	Automatic speech recognition	es_ES
dc.subject	Confidence estimation	es_ES
dc.subject	Confidence measures	es_ES
dc.subject	Deep bidirectional recurrent neural networks	es_ES
dc.subject	Long short-term memory	es_ES
dc.subject	Speaker adaptation	es_ES
dc.subject	Speech	es_ES
dc.subject	Adaptation models	es_ES
dc.subject	Computer architecture	es_ES
dc.subject	Training	es_ES
dc.subject	Recurrent neural networks	es_ES
dc.subject	Speech processing	es_ES
dc.subject	Task analysis	es_ES
dc.subject.classification	BIBLIOTECONOMIA Y DOCUMENTACION	es_ES
dc.subject.classification	LENGUAJES Y SISTEMAS INFORMATICOS	es_ES
dc.title	Speaker-Adapted Confidence Measures for ASR using Deep Bidirectional Recurrent Neural Networks	es_ES
dc.type	Artículo	es_ES
dc.identifier.doi	10.1109/TASLP.2018.2819900	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/MINECO//TIN2015-68326-R/ES/RECURSOS MULTILINGUES ABIERTOS PARA EDUCACION/	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/EC/FP7/287755/EU/Transcription and Translation of Video Lectures/	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/EC/H2020/761758/EU/X5gon: Cross Modal, Cross Cultural, Cross Lingual, Cross Domain, and Cross Site Global OER Network/	es_ES
dc.rights.accessRights	Abierto	es_ES
dc.contributor.affiliation	Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació	es_ES
dc.description.bibliographicCitation	Del Agua Teba, MA.; Giménez Pastor, A.; Sanchis Navarro, JA.; Civera Saiz, J.; Juan, A. (2018). Speaker-Adapted Confidence Measures for ASR using Deep Bidirectional Recurrent Neural Networks. IEEE/ACM Transactions on Audio Speech and Language Processing. 26(7):1198-1206. https://doi.org/10.1109/TASLP.2018.2819900	es_ES
dc.description.accrualMethod	S	es_ES
dc.relation.publisherversion	http://doi.org/10.1109/TASLP.2018.2819900	es_ES
dc.description.upvformatpinicio	1198	es_ES
dc.description.upvformatpfin	1206	es_ES
dc.type.version	info:eu-repo/semantics/publishedVersion	es_ES
dc.description.volume	26	es_ES
dc.description.issue	7	es_ES
dc.relation.pasarela	S\356121	es_ES
dc.contributor.funder	European Commission	es_ES
dc.contributor.funder	Ministerio de Economía y Competitividad	es_ES

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem

Speaker-Adapted Confidence Measures for ASR using Deep Bidirectional Recurrent Neural Networks

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Buscar en RiuNet

Listar

Todo RiuNet

Esta colección

Mi cuenta

Estadísticas

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Speaker-Adapted Confidence Measures for ASR using Deep Bidirectional Recurrent Neural Networks

Ficheros en el ítem

Este ítem aparece en la(s) siguiente(s) colección(ones)