- -

Exploiting multiple ASR outputs for a spoken language understanding task

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Compartir/Enviar a

Citas

Estadísticas

  • Estadisticas de Uso

Exploiting multiple ASR outputs for a spoken language understanding task

Mostrar el registro sencillo del ítem

Ficheros en el ítem

dc.contributor.author Calvo Lance, Marcos es_ES
dc.contributor.author García Granada, Fernando es_ES
dc.contributor.author Hurtado Oliver, Lluis Felip es_ES
dc.contributor.author Jiménez Serrano, Santiago es_ES
dc.contributor.author Sanchís Arnal, Emilio es_ES
dc.date.accessioned 2014-07-04T18:16:56Z
dc.date.issued 2013
dc.identifier.isbn 978-3-319-01930-7
dc.identifier.issn 0302-9743
dc.identifier.uri http://hdl.handle.net/10251/38611
dc.description The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-319-01931-4_19 es_ES
dc.description.abstract In this paper, we present an approach to Spoken Language Understanding, where the input to the semantic decoding process is a composition of multiple hypotheses provided by the Automatic Speech Recognition module. This way, the semantic constraints can be applied not only to a unique hypothesis, but also to other hypotheses that could represent a better recognition of the utterance. To do this, we have developed an algorithm to combine multiple sentences into a weighted graph of words, which is the input to the semantic decoding process. It has also been necessary to develop a specific algorithm to process these graphs of words according to the statistical models that represent the semantics of the task. This approach has been evaluated in a SLU task in Spanish. Results, considering different configurations of ASR outputs, show the better behavior of the system when a combination of hypotheses is considered. es_ES
dc.description.sponsorship This work is partially supported by the Spanish MICINN under contract TIN2011-28169-C05-01, and under FPU Grant AP2010-4193
dc.format.extent 8 es_ES
dc.language Inglés es_ES
dc.publisher Springer Verlag (Germany) es_ES
dc.relation.ispartof Speech and Computer es_ES
dc.relation.ispartofseries Lecture Notes in Computer Science;
dc.rights Reserva de todos los derechos es_ES
dc.subject Combination of multiple outputs es_ES
dc.subject Graph of words es_ES
dc.subject Graph of concepts es_ES
dc.subject Spoken Language Understanding es_ES
dc.subject.classification LENGUAJES Y SISTEMAS INFORMATICOS es_ES
dc.title Exploiting multiple ASR outputs for a spoken language understanding task es_ES
dc.type Capítulo de libro es_ES
dc.identifier.doi 10.1007/978-3-319-01931-4_19
dc.relation.projectID info:eu-repo/grantAgreement/MICINN//TIN2011-28169-C05-01/ES/TIMPANO-UPV: TECNOLOGIAS PARA LA INTERACCION CONVERSACIONAL COMPLAJE PERSONA-MAQUINA CON APRENDIZAJE DINAMICO/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/MECD//AP2010-4193/ES/AP2010-4193/
dc.rights.accessRights Abierto es_ES
dc.contributor.affiliation Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació es_ES
dc.contributor.affiliation Universitat Politècnica de València. Instituto Universitario de Aplicaciones de las Tecnologías de la Información - Institut Universitari d'Aplicacions de les Tecnologies de la Informació es_ES
dc.description.bibliographicCitation Calvo Lance, M.; García Granada, F.; Hurtado Oliver, LF.; Jiménez Serrano, S.; Sanchís Arnal, E. (2013). Exploiting multiple ASR outputs for a spoken language understanding task. En Speech and Computer. Springer Verlag (Germany). 8113:138-145. https://doi.org/10.1007/978-3-319-01931-4_19 es_ES
dc.description.accrualMethod S es_ES
dc.relation.conferencename 15th International Conference, SPECOM 2013 es_ES
dc.relation.conferencedate September 1-5, 2013 es_ES
dc.relation.conferenceplace Pilsen, Czech Republic es_ES
dc.relation.publisherversion http://link.springer.com/chapter/10.1007/978-3-319-01931-4_19 es_ES
dc.description.upvformatpinicio 138 es_ES
dc.description.upvformatpfin 145 es_ES
dc.type.version info:eu-repo/semantics/publishedVersion es_ES
dc.description.volume 8113 es_ES
dc.relation.senia 255963
dc.contributor.funder Ministerio de Ciencia e Innovación
dc.contributor.funder Ministerio de Educación, Cultura y Deporte es_ES
dc.description.references Tür, G., Mori, R.D.: Spoken Language Understanding: Systems for Extracting Semantic Information from Speech, 1st edn. Wiley (2011) es_ES
dc.description.references Fiscus, J.G.: A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (ROVER). In: Proceedings of the 1997 IEEE Workshop on Automatic Speech Recognition and Understanding, pp. 347–354. IEEE (1997) es_ES
dc.description.references Larkin, M.A., Blackshields, G., Brown, N.P., Chenna, R., McGettigan, P.A., McWilliam, H., Valentin, F., Wallace, I.M., Wilm, A., Lopez, R., Thompson, J.D., Gibson, T.J., Higgins, D.G.: ClustalW and ClustalX version 2.0. Bioinformatics 23, 2947–2948 (2007) es_ES
dc.description.references Sim, K.C., Byrne, W.J., Gales, M.J.F., Sahbi, H., Woodland, P.C.: Consensus network decoding for statistical machine translation system combination. In: IEEE Int. Conference on Acoustics, Speech, and Signal Processing (2007) es_ES
dc.description.references Bangalore, S., Bordel, G., Riccardi, G.: Computing Consensus Translation from Multiple Machine Translation Systems. In: Proceedings of IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2001), pp. 351–354 (2001) es_ES
dc.description.references Calvo, M., Hurtado, L.-F., García, F., Sanchís, E.: A Multilingual SLU System Based on Semantic Decoding of Graphs of Words. In: Torre Toledano, D., Ortega Giménez, A., Teixeira, A., González Rodríguez, J., Hernández Gómez, L., San Segundo Hernández, R., Ramos Castro, D. (eds.) IberSPEECH 2012. CCIS, vol. 328, pp. 158–167. Springer, Heidelberg (2012) es_ES
dc.description.references Hakkani-Tür, D., Béchet, F., Riccardi, G., Tür, G.: Beyond ASR 1-best: Using word confusion networks in spoken language understanding. Computer Speech & Language 20, 495–514 (2006) es_ES
dc.description.references Benedí, J.M., Lleida, E., Varona, A., Castro, M.J., Galiano, I., Justo, R., López de Letona, I., Miguel, A.: Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA. In: Proceedings of LREC 2006, Genoa, Italy, pp. 1636–1639 (2006) es_ES


Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem