- -

Using word graphs as intermediate representation of uttered sentences

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Compartir/Enviar a

Citas

Estadísticas

  • Estadisticas de Uso

Using word graphs as intermediate representation of uttered sentences

Mostrar el registro sencillo del ítem

Ficheros en el ítem

dc.contributor.author Gómez Adrian, Jon Ander es_ES
dc.contributor.author Sanchís Arnal, Emilio es_ES
dc.date.accessioned 2014-03-24T08:07:30Z
dc.date.issued 2012
dc.identifier.isbn 978-3-642-33274-6
dc.identifier.issn 0302-9743
dc.identifier.uri http://hdl.handle.net/10251/36585
dc.description The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-642-33275-3_35 es_ES
dc.description.abstract We present an algorithm for building graphs of words as an intermediate representation of uttered sentences. No language model is used. The input data for the algorithm are the pronunciation lexicon organized as a tree and the sequence of acoustic frames. The transition between consecutive units are considered as additional units. Nodes represent discrete instants of time, arcs are labelled with words, and a confidence measure is assigned to each detected word, which is computed by using the phonetic probabilities of the subsequence of acoustic frames used for completing the word. We evaluated the obtained word graphs by searching the path that best matches with the correct sentence and then measuring the word accuracy, i.e. the oracle word accuracy. © 2012 Springer-Verlag. es_ES
dc.description.sponsorship This work was supported by the Spanish MICINN under contract TIN2011-28169-C05-01 and the Vic. d’Investigació of the UPV under contract 20110897.
dc.format.extent 8 es_ES
dc.language Inglés es_ES
dc.publisher Springer Verlag (Germany) es_ES
dc.relation.ispartof Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications es_ES
dc.relation.ispartofseries Lecture Notes in Computer Science;7441
dc.rights Reserva de todos los derechos es_ES
dc.subject Confidence measures es_ES
dc.subject Lexical tree es_ES
dc.subject Word graphs es_ES
dc.subject Word lattices es_ES
dc.subject.classification LENGUAJES Y SISTEMAS INFORMATICOS es_ES
dc.title Using word graphs as intermediate representation of uttered sentences es_ES
dc.type Capítulo de libro es_ES
dc.embargo.lift 10000-01-01
dc.embargo.terms forever es_ES
dc.identifier.doi 10.1007/978-3-642-33275-3_35
dc.relation.projectID info:eu-repo/grantAgreement/MICINN//TIN2011-28169-C05-01/ES/TIMPANO-UPV: TECNOLOGIAS PARA LA INTERACCION CONVERSACIONAL COMPLAJE PERSONA-MAQUINA CON APRENDIZAJE DINAMICO/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/UPV//20110897/ es_ES
dc.rights.accessRights Abierto es_ES
dc.contributor.affiliation Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació es_ES
dc.description.bibliographicCitation Gómez Adrian, JA.; Sanchís Arnal, E. (2012). Using word graphs as intermediate representation of uttered sentences. En Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. Springer Verlag (Germany). 284-291. https://doi.org/10.1007/978-3-642-33275-3_35 es_ES
dc.description.accrualMethod S es_ES
dc.relation.conferencename 17th Iberoamerican Congress, CIARP 2012 es_ES
dc.relation.conferencedate September 3-6, 2012 es_ES
dc.relation.conferenceplace Buenos Aires, Argentina es_ES
dc.relation.publisherversion http://link.springer.com/chapter/10.1007%2F978-3-642-33275-3_35 es_ES
dc.description.upvformatpinicio 284 es_ES
dc.description.upvformatpfin 291 es_ES
dc.type.version info:eu-repo/semantics/publishedVersion es_ES
dc.relation.senia 231720
dc.contributor.funder Ministerio de Ciencia e Innovación
dc.contributor.funder Universitat Politècnica de València
dc.description.references Ortmanns, S., Ney, H., Aubert, X.: A word graph algorithm for large vocabulary continuous speech recognition. Computer Speech and Language 11, 43–72 (1997) es_ES
dc.description.references Ney, H., Ortmanns, S., Lindam, I.: Extensions to the word graph method for large vocabulary continuous speech recognition. In: Proceedings of IEEE ICASSP 1997, Munich, Germany, vol. 3, pp. 1791–1794 (1997) es_ES
dc.description.references Wessel, F., Schlüter, R., Macherey, K., Ney, H.: Confidence Measures for Large Vocabulary Continuous Speech Recognition. IEEE Transactions on Speech and Audio Processing 9(3), 288–298 (2001) es_ES
dc.description.references Ferreiros, J., San-Segundo, R., Fernández, F., D’Haro, L.-F., Sama, V., Barra, R., Mellén, P.: New word-level and sentence-level confidence scoring using graph theory calculus and its evaluation on speech understanding. In: Proceedings of INTERSPEECH 2005, Lisbon, Portugal, pp. 3377–3380 (2005) es_ES
dc.description.references Raymond, C., Béchet, F., De Mori, R., Damnati, G.: On the use of finite state transducers for semantic interpretation. Speech Communication 48, 288–304 (2006) es_ES
dc.description.references Hakkani-Tür, D., Béchet, F., Riccardi, G., Tur, G.: Beyond ASR 1-best: Using word confusion networks in spoken language understanding. Computer Speech and Language 20, 495–514 (2006) es_ES
dc.description.references Justo, R., Pérez, A., Torres, M.I.: Impact of the Approaches Involved on Word-Graph Derivation from the ASR System. In: Vitrià, J., Sanches, J.M., Hernández, M. (eds.) IbPRIA 2011. LNCS, vol. 6669, pp. 668–675. Springer, Heidelberg (2011) es_ES
dc.description.references Gómez, J.A., Calvo, M.: Improvements on Automatic Speech Segmentation at the Phonetic Level. In: San Martin, C., Kim, S.-W. (eds.) CIARP 2011. LNCS, vol. 7042, pp. 557–564. Springer, Heidelberg (2011) es_ES
dc.description.references Calvo, M., Gómez, J.A., Sanchis, E., Hurtado, L.F.: An algorithm for automatic speech understanding over word graphs. Procesamiento del Lenguaje Natural (48) (accepted, pending of publication, 2012) es_ES
dc.description.references Moreno, A., Poch, D., Bonafonte, A., Lleida, E., Llisterri, J., Mariño, J.B., Nadeu, C.: Albayzin Speech Database: Design of the Phonetic Corpus. In: Proceedings of Eurospeech, Berlin, Germany, vol. 1, pp. 653–656 (September 1993) es_ES
dc.description.references Benedí, J.M., Lleida, E., Varona, A., Castro, M., Galiano, I., Justo, R., López, I., Miguel, A.: Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA. In: Proc. of LREC 2006, Genova, Italy (2006) es_ES


Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem