Exploiting multiple ASR outputs for a spoken language understanding task

Calvo Lance, Marcos; García Granada, Fernando; Hurtado Oliver, Lluis Felip; Jiménez Serrano, Santiago; Sanchís Arnal, Emilio

doi:10.1007/978-3-319-01931-4_19

Identificarse

Buscar en RiuNet

Listar

Todo RiuNet
Esta colección

Mi cuenta

Acceder

Estadísticas

Ver Estadísticas de uso

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Exploiting multiple ASR outputs for a spoken language understanding task

Mostrar el registro sencillo del ítem

Ficheros en el ítem

Nombre: monolingualSLU_Sp ...

Tamaño: 201.0Kb

Formato: PDF

Descripción: Versión del Autor.

Abrir

Nombre: SPECOM_Preprint.pdf

Tamaño: 205.5Kb

Formato: PDF

Descripción: Versión editorial

Solicitar una copia al autor

dc.contributor.author	Calvo Lance, Marcos	es_ES
dc.contributor.author	García Granada, Fernando	es_ES
dc.contributor.author	Hurtado Oliver, Lluis Felip	es_ES
dc.contributor.author	Jiménez Serrano, Santiago	es_ES
dc.contributor.author	Sanchís Arnal, Emilio	es_ES
dc.date.accessioned	2014-07-04T18:16:56Z
dc.date.issued	2013
dc.identifier.isbn	978-3-319-01930-7
dc.identifier.issn	0302-9743
dc.identifier.uri	http://hdl.handle.net/10251/38611
dc.description	The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-319-01931-4_19	es_ES
dc.description.abstract	In this paper, we present an approach to Spoken Language Understanding, where the input to the semantic decoding process is a composition of multiple hypotheses provided by the Automatic Speech Recognition module. This way, the semantic constraints can be applied not only to a unique hypothesis, but also to other hypotheses that could represent a better recognition of the utterance. To do this, we have developed an algorithm to combine multiple sentences into a weighted graph of words, which is the input to the semantic decoding process. It has also been necessary to develop a specific algorithm to process these graphs of words according to the statistical models that represent the semantics of the task. This approach has been evaluated in a SLU task in Spanish. Results, considering different configurations of ASR outputs, show the better behavior of the system when a combination of hypotheses is considered.	es_ES
dc.description.sponsorship	This work is partially supported by the Spanish MICINN under contract TIN2011-28169-C05-01, and under FPU Grant AP2010-4193
dc.format.extent	8	es_ES
dc.language	Inglés	es_ES
dc.publisher	Springer Verlag (Germany)	es_ES
dc.relation.ispartof	Speech and Computer	es_ES
dc.relation.ispartofseries	Lecture Notes in Computer Science;
dc.rights	Reserva de todos los derechos	es_ES
dc.subject	Combination of multiple outputs	es_ES
dc.subject	Graph of words	es_ES
dc.subject	Graph of concepts	es_ES
dc.subject	Spoken Language Understanding	es_ES
dc.subject.classification	LENGUAJES Y SISTEMAS INFORMATICOS	es_ES
dc.title	Exploiting multiple ASR outputs for a spoken language understanding task	es_ES
dc.type	Capítulo de libro	es_ES
dc.identifier.doi	10.1007/978-3-319-01931-4_19
dc.relation.projectID	info:eu-repo/grantAgreement/MICINN//TIN2011-28169-C05-01/ES/TIMPANO-UPV: TECNOLOGIAS PARA LA INTERACCION CONVERSACIONAL COMPLAJE PERSONA-MAQUINA CON APRENDIZAJE DINAMICO/	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/MECD//AP2010-4193/ES/AP2010-4193/
dc.rights.accessRights	Abierto	es_ES
dc.contributor.affiliation	Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació	es_ES
dc.contributor.affiliation	Universitat Politècnica de València. Instituto Universitario de Aplicaciones de las Tecnologías de la Información - Institut Universitari d'Aplicacions de les Tecnologies de la Informació	es_ES
dc.description.bibliographicCitation	Calvo Lance, M.; García Granada, F.; Hurtado Oliver, LF.; Jiménez Serrano, S.; Sanchís Arnal, E. (2013). Exploiting multiple ASR outputs for a spoken language understanding task. En Speech and Computer. Springer Verlag (Germany). 8113:138-145. https://doi.org/10.1007/978-3-319-01931-4_19	es_ES
dc.description.accrualMethod	S	es_ES
dc.relation.conferencename	15th International Conference, SPECOM 2013	es_ES
dc.relation.conferencedate	September 1-5, 2013	es_ES
dc.relation.conferenceplace	Pilsen, Czech Republic	es_ES
dc.relation.publisherversion	http://link.springer.com/chapter/10.1007/978-3-319-01931-4_19	es_ES
dc.description.upvformatpinicio	138	es_ES
dc.description.upvformatpfin	145	es_ES
dc.type.version	info:eu-repo/semantics/publishedVersion	es_ES
dc.description.volume	8113	es_ES
dc.relation.senia	255963
dc.contributor.funder	Ministerio de Ciencia e Innovación
dc.contributor.funder	Ministerio de Educación, Cultura y Deporte	es_ES
dc.description.references	Tür, G., Mori, R.D.: Spoken Language Understanding: Systems for Extracting Semantic Information from Speech, 1st edn. Wiley (2011)	es_ES
dc.description.references	Fiscus, J.G.: A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (ROVER). In: Proceedings of the 1997 IEEE Workshop on Automatic Speech Recognition and Understanding, pp. 347–354. IEEE (1997)	es_ES
dc.description.references	Larkin, M.A., Blackshields, G., Brown, N.P., Chenna, R., McGettigan, P.A., McWilliam, H., Valentin, F., Wallace, I.M., Wilm, A., Lopez, R., Thompson, J.D., Gibson, T.J., Higgins, D.G.: ClustalW and ClustalX version 2.0. Bioinformatics 23, 2947–2948 (2007)	es_ES
dc.description.references	Sim, K.C., Byrne, W.J., Gales, M.J.F., Sahbi, H., Woodland, P.C.: Consensus network decoding for statistical machine translation system combination. In: IEEE Int. Conference on Acoustics, Speech, and Signal Processing (2007)	es_ES
dc.description.references	Bangalore, S., Bordel, G., Riccardi, G.: Computing Consensus Translation from Multiple Machine Translation Systems. In: Proceedings of IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2001), pp. 351–354 (2001)	es_ES
dc.description.references	Calvo, M., Hurtado, L.-F., García, F., Sanchís, E.: A Multilingual SLU System Based on Semantic Decoding of Graphs of Words. In: Torre Toledano, D., Ortega Giménez, A., Teixeira, A., González Rodríguez, J., Hernández Gómez, L., San Segundo Hernández, R., Ramos Castro, D. (eds.) IberSPEECH 2012. CCIS, vol. 328, pp. 158–167. Springer, Heidelberg (2012)	es_ES
dc.description.references	Hakkani-Tür, D., Béchet, F., Riccardi, G., Tür, G.: Beyond ASR 1-best: Using word confusion networks in spoken language understanding. Computer Speech & Language 20, 495–514 (2006)	es_ES
dc.description.references	Benedí, J.M., Lleida, E., Varona, A., Castro, M.J., Galiano, I., Justo, R., López de Letona, I., Miguel, A.: Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA. In: Proceedings of LREC 2006, Genoa, Italy, pp. 1636–1639 (2006)	es_ES

Este ítem aparece en la(s) siguiente(s) colección(ones)

Artículos, conferencias, monografías [46836]

Mostrar el registro sencillo del ítem

Exploiting multiple ASR outputs for a spoken language understanding task

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Buscar en RiuNet

Listar

Todo RiuNet

Esta colección

Mi cuenta

Estadísticas

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Exploiting multiple ASR outputs for a spoken language understanding task

Ficheros en el ítem

Este ítem aparece en la(s) siguiente(s) colección(ones)