- -

Language model adaptation for video lectures transcription

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Compartir/Enviar a

Citas

Estadísticas

  • Estadisticas de Uso

Language model adaptation for video lectures transcription

Mostrar el registro sencillo del ítem

Ficheros en el ítem

dc.contributor.author Martínez-Villaronga, Adrià es_ES
dc.contributor.author Del Agua Teba, Miguel Angel es_ES
dc.contributor.author Andrés Ferrer, Jesús es_ES
dc.contributor.author Juan Císcar, Alfonso es_ES
dc.date.accessioned 2014-09-23T14:16:32Z
dc.date.available 2014-09-23T14:16:32Z
dc.date.issued 2013
dc.identifier.isbn 978-1-4799-0355-9
dc.identifier.uri http://hdl.handle.net/10251/39899
dc.description.abstract Videolectures are currently being digitised all over the world for its enormous value as reference resource. Many of these lectures are accompanied with slides. The slides offer a great opportunity for improving ASR systems performance. We propose a simple yet powerful extension to the linear interpolation of language models for adapting language models with slide information. Two types of slides are considered, correct slides, and slides automatic extracted from the videos with OCR. Furthermore, we compare both time aligned and unaligned slides. Results report an improvement of up to 3.8 % absolute WER points when using correct slides. Surprisingly, when using automatic slides obtained with poor OCR quality, the ASR system still improves up to 2.2 absolute WER points. es_ES
dc.description.sponsorship The research leading to these results has received funding from the European Union Seventh Framework Programme (FP7/2007-2013) under grant agreement no 287755 (transLectures). Also supported by the Spanish Government (Plan E, iTrans2 TIN2009-14511). es_ES
dc.language Inglés es_ES
dc.publisher IInstitute of Electrical and Electronics Engineers (IEEE) es_ES
dc.relation.ispartof Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on es_ES
dc.rights Reserva de todos los derechos es_ES
dc.subject Language model adaptation es_ES
dc.subject Video lectures es_ES
dc.subject.classification ESTADISTICA E INVESTIGACION OPERATIVA es_ES
dc.subject.classification LENGUAJES Y SISTEMAS INFORMATICOS es_ES
dc.title Language model adaptation for video lectures transcription es_ES
dc.type Capítulo de libro es_ES
dc.identifier.doi 10.1109/ICASSP.2013.6639314
dc.relation.projectID info:eu-repo/grantAgreement/MICINN//TIN2009-14511/ES/Traduccion De Textos Y Transcripcion De Voz Interactivas/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/EC/FP7/287755/EU/Transcription and Translation of Video Lectures/ es_ES
dc.rights.accessRights Abierto es_ES
dc.contributor.affiliation Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació es_ES
dc.contributor.affiliation Universitat Politècnica de València. Instituto Universitario Mixto Tecnológico de Informática - Institut Universitari Mixt Tecnològic d'Informàtica es_ES
dc.description.bibliographicCitation Martínez-Villaronga, A.; Del Agua Teba, MA.; Andrés Ferrer, J.; Juan Císcar, A. (2013). Language model adaptation for video lectures transcription. En Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. IInstitute of Electrical and Electronics Engineers (IEEE). 8450-8454. https://doi.org/10.1109/ICASSP.2013.6639314 es_ES
dc.description.accrualMethod S es_ES
dc.relation.publisherversion http://dx.doi.org/10.1109/ICASSP.2013.6639314 es_ES
dc.description.upvformatpinicio 8450 es_ES
dc.description.upvformatpfin 8454 es_ES
dc.relation.senia 251572
dc.contributor.funder European Commission es_ES
dc.contributor.funder Ministerio de Ciencia e Innovación es_ES


Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem