- -

Combining Several ASR Outputs in a Graph-Based SLU System

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Compartir/Enviar a

Citas

Estadísticas

  • Estadisticas de Uso

Combining Several ASR Outputs in a Graph-Based SLU System

Mostrar el registro sencillo del ítem

Ficheros en el ítem

dc.contributor.author Calvo Lance, Marcos es_ES
dc.contributor.author Hurtado Oliver, Lluis Felip es_ES
dc.contributor.author García-Granada, Fernando es_ES
dc.contributor.author Sanchís Arnal, Emilio es_ES
dc.date.accessioned 2016-06-24T10:23:20Z
dc.date.available 2016-06-24T10:23:20Z
dc.date.issued 2015-11-09
dc.identifier.isbn 978-3-319-25751-8
dc.identifier.issn 0302-9743
dc.identifier.uri http://hdl.handle.net/10251/66425
dc.description The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-319-25751-8_66 es_ES
dc.description.abstract In this paper, we present an approach to Spoken Language Understanding (SLU) where we perform a combination of multiple hypotheses from several Automatic Speech Recognizers (ASRs) in order to reduce the impact of recognition errors in the SLU module. This combination is performed using a Grammatical Inference algorithm that provides a generalization of the input sentences by means of a weighted graph of words. We have also developed a specific SLU algorithm that is able to process these graphs of words according to a stochastic semantic modelling.The results show that the combinations of several hypotheses from the ASR module outperform the results obtained by taking just the 1-best transcription es_ES
dc.description.sponsorship This work is partially supported by the Spanish MEC under contract TIN2014-54288-C4-3-R and FPU Grant AP2010-4193. es_ES
dc.format.extent 8 es_ES
dc.language Inglés es_ES
dc.publisher Springer es_ES
dc.relation.ispartof Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications es_ES
dc.relation.ispartofseries Lecture Notes in Computer Science;9423
dc.rights Reserva de todos los derechos es_ES
dc.subject Graph of words es_ES
dc.subject Graph of concepts es_ES
dc.subject Spoken language understanding es_ES
dc.subject.classification LENGUAJES Y SISTEMAS INFORMATICOS es_ES
dc.title Combining Several ASR Outputs in a Graph-Based SLU System es_ES
dc.type Capítulo de libro es_ES
dc.type Comunicación en congreso es_ES
dc.identifier.doi 10.1007/978-3-319-25751-8_66
dc.relation.projectID info:eu-repo/grantAgreement/MINECO//TIN2014-54288-C4-3-R/ES/PROCESADO DE AUDIO, HABLA Y LENGUAJE PARA ANALISIS DE INFORMACION MULTIMEDIA/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/MECD//AP2010-4193/ES/AP2010-4193/ es_ES
dc.rights.accessRights Abierto es_ES
dc.contributor.affiliation Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació es_ES
dc.description.bibliographicCitation Calvo Lance, M.; Hurtado Oliver, LF.; García-Granada, F.; Sanchís Arnal, E. (2015). Combining Several ASR Outputs in a Graph-Based SLU System. En Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. Springer. 551-558. https://doi.org/10.1007/978-3-319-25751-8_66 es_ES
dc.description.accrualMethod S es_ES
dc.relation.conferencename 20th Iberoamerican Congress on Pattern Recognition (CIARP-2015) es_ES
dc.relation.conferencedate November 9-12, 2015 es_ES
dc.relation.conferenceplace Montevideo, Uruguay es_ES
dc.relation.publisherversion http://link.springer.com/chapter/10.1007%2F978-3-319-25751-8_66 es_ES
dc.description.upvformatpinicio 551 es_ES
dc.description.upvformatpfin 558 es_ES
dc.type.version info:eu-repo/semantics/publishedVersion es_ES
dc.relation.senia 297368 es_ES
dc.contributor.funder Ministerio de Economía y Competitividad es_ES
dc.description.references Bangalore, S., Bordel, G., Riccardi, G.: Computing consensus translation from multiple machine translation systems. In: ASRU, pp. 351–354 (2001) es_ES
dc.description.references Benedí, J.M., Lleida, E., Varona, A., Castro, M.J., Galiano, I., Justo, R., de Letona, I.L., Miguel, A.: Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA. In: LREC, pp. 1636–1639 (2006) es_ES
dc.description.references Bonneau-Maynard, H., Lefèvre, F.: Investigating stochastic speech understanding. In: IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), pp. 260–263 (2001) es_ES
dc.description.references Calvo, M., García, F., Hurtado, L.F., Jiménez, S., Sanchis, E.: Exploiting multiple hypotheses for multilingual spoken language understanding. In: CoNLL, pp. 193–201 (2013) es_ES
dc.description.references Fiscus, J.G.: A post-processing system to yield reduced word error rates: recognizer output voting error reduction (ROVER). In: 1997 IEEE Workshop on Automatic Speech Recognition and Understanding, pp. 347–354 (1997) es_ES
dc.description.references Hahn, S., Dinarelli, M., Raymond, C., Lefèvre, F., Lehnen, P., De Mori, R., Moschitti, A., Ney, H., Riccardi, G.: Comparing stochastic approaches to spoken language understanding in multiple languages. IEEE Transactions on Audio, Speech, and Language Processing 6(99), 1569–1583 (2010) es_ES
dc.description.references Hakkani-Tür, D., Béchet, F., Riccardi, G., Tür, G.: Beyond ASR 1-best: Using word confusion networks in spoken language understanding. Computer Speech & Language 20(4), 495–514 (2006) es_ES
dc.description.references He, Y., Young, S.: Spoken language understanding using the hidden vector state model. Speech Communication 48, 262–275 (2006) es_ES
dc.description.references Larkin, M.A., Blackshields, G., Brown, N.P., Chenna, R., McGettigan, P.A., McWilliam, H., Valentin, F., Wallace, I.M., Wilm, A., Lopez, R., Thompson, J.D., Gibson, T.J., Higgins, D.G.: ClustalW and ClustalX version 2.0. Bioinformatics 23(21), 2947–2948 (2007) es_ES
dc.description.references Segarra, E., Sanchis, E., Galiano, M., García, F., Hurtado, L.: Extracting Semantic Information Through Automatic Learning Techniques. IJPRAI 16(3), 301–307 (2002) es_ES
dc.description.references Tür, G., Deoras, A., Hakkani-Tür, D.: Semantic parsing using word confusion networks with conditional random fields. In: INTERSPEECH (2013) es_ES


Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem