Mostrar el registro sencillo del ítem
dc.contributor.author | Martínez-Villaronga, Adrià | es_ES |
dc.contributor.author | Del Agua Teba, Miguel Angel | es_ES |
dc.contributor.author | Andrés Ferrer, Jesús | es_ES |
dc.contributor.author | Juan Císcar, Alfonso | es_ES |
dc.date.accessioned | 2014-09-23T14:16:32Z | |
dc.date.available | 2014-09-23T14:16:32Z | |
dc.date.issued | 2013 | |
dc.identifier.isbn | 978-1-4799-0355-9 | |
dc.identifier.uri | http://hdl.handle.net/10251/39899 | |
dc.description.abstract | Videolectures are currently being digitised all over the world for its enormous value as reference resource. Many of these lectures are accompanied with slides. The slides offer a great opportunity for improving ASR systems performance. We propose a simple yet powerful extension to the linear interpolation of language models for adapting language models with slide information. Two types of slides are considered, correct slides, and slides automatic extracted from the videos with OCR. Furthermore, we compare both time aligned and unaligned slides. Results report an improvement of up to 3.8 % absolute WER points when using correct slides. Surprisingly, when using automatic slides obtained with poor OCR quality, the ASR system still improves up to 2.2 absolute WER points. | es_ES |
dc.description.sponsorship | The research leading to these results has received funding from the European Union Seventh Framework Programme (FP7/2007-2013) under grant agreement no 287755 (transLectures). Also supported by the Spanish Government (Plan E, iTrans2 TIN2009-14511). | es_ES |
dc.language | Inglés | es_ES |
dc.publisher | IInstitute of Electrical and Electronics Engineers (IEEE) | es_ES |
dc.relation.ispartof | Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on | es_ES |
dc.rights | Reserva de todos los derechos | es_ES |
dc.subject | Language model adaptation | es_ES |
dc.subject | Video lectures | es_ES |
dc.subject.classification | ESTADISTICA E INVESTIGACION OPERATIVA | es_ES |
dc.subject.classification | LENGUAJES Y SISTEMAS INFORMATICOS | es_ES |
dc.title | Language model adaptation for video lectures transcription | es_ES |
dc.type | Capítulo de libro | es_ES |
dc.identifier.doi | 10.1109/ICASSP.2013.6639314 | |
dc.relation.projectID | info:eu-repo/grantAgreement/MICINN//TIN2009-14511/ES/Traduccion De Textos Y Transcripcion De Voz Interactivas/ | es_ES |
dc.relation.projectID | info:eu-repo/grantAgreement/EC/FP7/287755/EU/Transcription and Translation of Video Lectures/ | es_ES |
dc.rights.accessRights | Abierto | es_ES |
dc.contributor.affiliation | Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació | es_ES |
dc.contributor.affiliation | Universitat Politècnica de València. Instituto Universitario Mixto Tecnológico de Informática - Institut Universitari Mixt Tecnològic d'Informàtica | es_ES |
dc.description.bibliographicCitation | Martínez-Villaronga, A.; Del Agua Teba, MA.; Andrés Ferrer, J.; Juan Císcar, A. (2013). Language model adaptation for video lectures transcription. En Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. IInstitute of Electrical and Electronics Engineers (IEEE). 8450-8454. https://doi.org/10.1109/ICASSP.2013.6639314 | es_ES |
dc.description.accrualMethod | S | es_ES |
dc.relation.publisherversion | http://dx.doi.org/10.1109/ICASSP.2013.6639314 | es_ES |
dc.description.upvformatpinicio | 8450 | es_ES |
dc.description.upvformatpfin | 8454 | es_ES |
dc.relation.senia | 251572 | |
dc.contributor.funder | European Commission | es_ES |
dc.contributor.funder | Ministerio de Ciencia e Innovación | es_ES |