Mostrar el registro sencillo del ítem
dc.contributor.author | Calvo Lance, Marcos | es_ES |
dc.contributor.author | García Granada, Fernando | es_ES |
dc.contributor.author | Hurtado Oliver, Lluis Felip | es_ES |
dc.contributor.author | Jiménez Serrano, Santiago | es_ES |
dc.contributor.author | Sanchís Arnal, Emilio | es_ES |
dc.date.accessioned | 2014-07-04T18:16:56Z | |
dc.date.issued | 2013 | |
dc.identifier.isbn | 978-3-319-01930-7 | |
dc.identifier.issn | 0302-9743 | |
dc.identifier.uri | http://hdl.handle.net/10251/38611 | |
dc.description | The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-319-01931-4_19 | es_ES |
dc.description.abstract | In this paper, we present an approach to Spoken Language Understanding, where the input to the semantic decoding process is a composition of multiple hypotheses provided by the Automatic Speech Recognition module. This way, the semantic constraints can be applied not only to a unique hypothesis, but also to other hypotheses that could represent a better recognition of the utterance. To do this, we have developed an algorithm to combine multiple sentences into a weighted graph of words, which is the input to the semantic decoding process. It has also been necessary to develop a specific algorithm to process these graphs of words according to the statistical models that represent the semantics of the task. This approach has been evaluated in a SLU task in Spanish. Results, considering different configurations of ASR outputs, show the better behavior of the system when a combination of hypotheses is considered. | es_ES |
dc.description.sponsorship | This work is partially supported by the Spanish MICINN under contract TIN2011-28169-C05-01, and under FPU Grant AP2010-4193 | |
dc.format.extent | 8 | es_ES |
dc.language | Inglés | es_ES |
dc.publisher | Springer Verlag (Germany) | es_ES |
dc.relation.ispartof | Speech and Computer | es_ES |
dc.relation.ispartofseries | Lecture Notes in Computer Science; | |
dc.rights | Reserva de todos los derechos | es_ES |
dc.subject | Combination of multiple outputs | es_ES |
dc.subject | Graph of words | es_ES |
dc.subject | Graph of concepts | es_ES |
dc.subject | Spoken Language Understanding | es_ES |
dc.subject.classification | LENGUAJES Y SISTEMAS INFORMATICOS | es_ES |
dc.title | Exploiting multiple ASR outputs for a spoken language understanding task | es_ES |
dc.type | Capítulo de libro | es_ES |
dc.identifier.doi | 10.1007/978-3-319-01931-4_19 | |
dc.relation.projectID | info:eu-repo/grantAgreement/MICINN//TIN2011-28169-C05-01/ES/TIMPANO-UPV: TECNOLOGIAS PARA LA INTERACCION CONVERSACIONAL COMPLAJE PERSONA-MAQUINA CON APRENDIZAJE DINAMICO/ | es_ES |
dc.relation.projectID | info:eu-repo/grantAgreement/MECD//AP2010-4193/ES/AP2010-4193/ | |
dc.rights.accessRights | Abierto | es_ES |
dc.contributor.affiliation | Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació | es_ES |
dc.contributor.affiliation | Universitat Politècnica de València. Instituto Universitario de Aplicaciones de las Tecnologías de la Información - Institut Universitari d'Aplicacions de les Tecnologies de la Informació | es_ES |
dc.description.bibliographicCitation | Calvo Lance, M.; García Granada, F.; Hurtado Oliver, LF.; Jiménez Serrano, S.; Sanchís Arnal, E. (2013). Exploiting multiple ASR outputs for a spoken language understanding task. En Speech and Computer. Springer Verlag (Germany). 8113:138-145. https://doi.org/10.1007/978-3-319-01931-4_19 | es_ES |
dc.description.accrualMethod | S | es_ES |
dc.relation.conferencename | 15th International Conference, SPECOM 2013 | es_ES |
dc.relation.conferencedate | September 1-5, 2013 | es_ES |
dc.relation.conferenceplace | Pilsen, Czech Republic | es_ES |
dc.relation.publisherversion | http://link.springer.com/chapter/10.1007/978-3-319-01931-4_19 | es_ES |
dc.description.upvformatpinicio | 138 | es_ES |
dc.description.upvformatpfin | 145 | es_ES |
dc.type.version | info:eu-repo/semantics/publishedVersion | es_ES |
dc.description.volume | 8113 | es_ES |
dc.relation.senia | 255963 | |
dc.contributor.funder | Ministerio de Ciencia e Innovación | |
dc.contributor.funder | Ministerio de Educación, Cultura y Deporte | es_ES |
dc.description.references | Tür, G., Mori, R.D.: Spoken Language Understanding: Systems for Extracting Semantic Information from Speech, 1st edn. Wiley (2011) | es_ES |
dc.description.references | Fiscus, J.G.: A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (ROVER). In: Proceedings of the 1997 IEEE Workshop on Automatic Speech Recognition and Understanding, pp. 347–354. IEEE (1997) | es_ES |
dc.description.references | Larkin, M.A., Blackshields, G., Brown, N.P., Chenna, R., McGettigan, P.A., McWilliam, H., Valentin, F., Wallace, I.M., Wilm, A., Lopez, R., Thompson, J.D., Gibson, T.J., Higgins, D.G.: ClustalW and ClustalX version 2.0. Bioinformatics 23, 2947–2948 (2007) | es_ES |
dc.description.references | Sim, K.C., Byrne, W.J., Gales, M.J.F., Sahbi, H., Woodland, P.C.: Consensus network decoding for statistical machine translation system combination. In: IEEE Int. Conference on Acoustics, Speech, and Signal Processing (2007) | es_ES |
dc.description.references | Bangalore, S., Bordel, G., Riccardi, G.: Computing Consensus Translation from Multiple Machine Translation Systems. In: Proceedings of IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2001), pp. 351–354 (2001) | es_ES |
dc.description.references | Calvo, M., Hurtado, L.-F., García, F., Sanchís, E.: A Multilingual SLU System Based on Semantic Decoding of Graphs of Words. In: Torre Toledano, D., Ortega Giménez, A., Teixeira, A., González Rodríguez, J., Hernández Gómez, L., San Segundo Hernández, R., Ramos Castro, D. (eds.) IberSPEECH 2012. CCIS, vol. 328, pp. 158–167. Springer, Heidelberg (2012) | es_ES |
dc.description.references | Hakkani-Tür, D., Béchet, F., Riccardi, G., Tür, G.: Beyond ASR 1-best: Using word confusion networks in spoken language understanding. Computer Speech & Language 20, 495–514 (2006) | es_ES |
dc.description.references | Benedí, J.M., Lleida, E., Varona, A., Castro, M.J., Galiano, I., Justo, R., López de Letona, I., Miguel, A.: Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA. In: Proceedings of LREC 2006, Genoa, Italy, pp. 1636–1639 (2006) | es_ES |