Language model adaptation for lecture transcription by document retrieval

Martínez-Villaronga, Adrià; Del Agua Teba, Miguel Angel; Silvestre Cerdà, Joan Albert; Andrés Ferrer, Jesús; Juan, Alfons

doi:10.1007/978-3-319-13623-3_14

Identificarse

Buscar en RiuNet

Listar

Todo RiuNet
Esta colección

Mi cuenta

Acceder

Estadísticas

Ver Estadísticas de uso

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Language model adaptation for lecture transcription by document retrieval

Mostrar el registro sencillo del ítem

Ficheros en el ítem

Nombre: ibsp14-cameraReady.pdf

Tamaño: 250.8Kb

Formato: PDF

Descripción: Versión del Autor.

Abrir

Nombre: editor-ibsp14.pdf

Tamaño: 130.9Kb

Formato: PDF

Descripción: Versión editorial

Solicitar una copia al autor

dc.contributor.author	Martínez-Villaronga, Adrià	es_ES
dc.contributor.author	Del Agua Teba, Miguel Angel	es_ES
dc.contributor.author	Silvestre Cerdà, Joan Albert	es_ES
dc.contributor.author	Andrés Ferrer, Jesús	es_ES
dc.contributor.author	Juan, Alfons	es_ES
dc.date.accessioned	2015-05-21T18:19:52Z
dc.date.available	2015-05-21T18:19:52Z
dc.date.issued	2014
dc.identifier.issn	0302-9743
dc.identifier.uri	http://hdl.handle.net/10251/50657
dc.description	The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-319-13623-3_14	es_ES
dc.description.abstract	With the spread of MOOCs and video lecture repositories it is more important than ever to have accurate methods for automatically transcribing video lectures. In this work, we propose a simple yet effective language model adaptation technique based on document retrieval from the web. This technique is combined with slide adaptation, and compared against a strong baseline language model and a stronger slide-adapted baseline. These adaptation techniques are compared within two different acoustic models: a standard HMM model and the CD-DNN-HMM model. The proposed method obtains improvements on WER of up to 14% relative with respect to a competitive baseline as well as outperforming slide adaptation.	es_ES
dc.description.sponsorship	The research leading to these results has received fund-ing from the European Union Seventh Framework Programme (FP7/2007-2013)under grant agreement no 287755 (transLectures) and ICT Policy Support Pro-gramme (ICT PSP/2007-2013) as part of the Competitiveness and Innovation Framework Programme (CIP) under grant agreement no 621030 (EMMA), the Spanish MINECO Active2Trans (TIN2012-31723) research project and the Spanish Government with the FPU scholarships FPU13/06241 and AP2010-4349.	es_ES
dc.language	Inglés	es_ES
dc.publisher	Springer Verlag (Germany)	es_ES
dc.relation.ispartof	Advances in Speech and Language Technologies for Iberian Languages	es_ES
dc.relation.ispartofseries	Lecture Notes in Computer Science;8854
dc.rights	Reserva de todos los derechos	es_ES
dc.subject	Language model adaptation	es_ES
dc.subject	Video lectures	es_ES
dc.subject	Document retrieval	es_ES
dc.subject.classification	ESTADISTICA E INVESTIGACION OPERATIVA	es_ES
dc.subject.classification	CIENCIAS DE LA COMPUTACION E INTELIGENCIA ARTIFICIAL	es_ES
dc.subject.classification	LENGUAJES Y SISTEMAS INFORMATICOS	es_ES
dc.title	Language model adaptation for lecture transcription by document retrieval	es_ES
dc.type	Capítulo de libro	es_ES
dc.identifier.doi	10.1007/978-3-319-13623-3_14
dc.relation.projectID	info:eu-repo/grantAgreement/EC/FP7/287755/EU/Transcription and Translation of Video Lectures/	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/EC/CIP/621030/EU/European Multiple MOOC Aggregator/EMMA/	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/MINECO//TIN2012-31723/ES/INTERACCION ACTIVA PARA TRANSCRIPCION DE HABLA Y TRADUCCION/	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/MECD//FPU13%2F06241/ES/FPU13%2F06241/	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/MECD//AP2010-4349/ES/AP2010-4349/	es_ES
dc.rights.accessRights	Abierto	es_ES
dc.contributor.affiliation	Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació	es_ES
dc.contributor.affiliation	Universitat Politècnica de València. Departamento de Estadística e Investigación Operativa Aplicadas y Calidad - Departament d'Estadística i Investigació Operativa Aplicades i Qualitat	es_ES
dc.description.bibliographicCitation	Martínez-Villaronga, A.; Del Agua Teba, MA.; Silvestre Cerdà, JA.; Andrés Ferrer, J.; Juan, A. (2014). Language model adaptation for lecture transcription by document retrieval. En Advances in Speech and Language Technologies for Iberian Languages. Springer Verlag (Germany). 129-137. https://doi.org/10.1007/978-3-319-13623-3_14	es_ES
dc.description.accrualMethod	S	es_ES
dc.relation.publisherversion	http://dx.doi.org/10.1007/978-3-319-13623-3_14	es_ES
dc.description.upvformatpinicio	129	es_ES
dc.description.upvformatpfin	137	es_ES
dc.type.version	info:eu-repo/semantics/publishedVersion	es_ES
dc.relation.senia	277410
dc.contributor.funder	European Commission	es_ES
dc.contributor.funder	Ministerio de Economía y Competitividad	es_ES
dc.contributor.funder	Ministerio de Educación, Cultura y Deporte	es_ES
dc.description.references	coursera.org: Take the World’s Best Courses, Online, For Free, http://www.coursera.org/	es_ES
dc.description.references	poliMedia: Videolectures from the “Universitat Politècnica de València, http://polimedia.upv.es/catalogo/	es_ES
dc.description.references	SuperLectures: We take full care of your event video recordings, http://www.superlectures.com	es_ES
dc.description.references	transLectures, https://translectures.eu/	es_ES
dc.description.references	transLectures-UPV Toolkit (TLK) for Automatic Speech Recognition, http://translectures.eu/tlk	es_ES
dc.description.references	Udacity: Learn, Think, Do, http://www.udacity.com/	es_ES
dc.description.references	Videolectures.NET: Exchange Ideas and Share Knowledge, http://www.videolectures.net/	es_ES
dc.description.references	del-Agua, M.A., Giménez, A., Serrano, N., Andrés-Ferrer, J., Civera, J., Sanchis, A., Juan, A.: The translectures-UPV toolkit. In: Navarro Mesa, J.L., Giménez, A.O., Teixeira, A. (eds.) IberSPEECH 2014. LNCS (LNAI), vol. 8854, pp. 269–278. Springer, Heidelberg (2014)	es_ES
dc.description.references	Chang, P.C., Shan Lee, L.: Improved language model adaptation using existing and derived external resources. In: Proc. of ASRU, pp. 531–536 (2003)	es_ES
dc.description.references	Chen, S.F., Goodman, J.: An empirical study of smoothing techniques for language modeling. Computer Speech & Language 13(4), 359–393 (1999)	es_ES
dc.description.references	Jelinek, F., Mercer, R.L.: Interpolated Estimation of Markov Source Parameters from Sparse Data. In: Proc. of the Workshop on Pattern Recognition in Practice, pp. 381–397 (1980)	es_ES
dc.description.references	Ketterl, M., Schulte, O.A., Hochman, A.: Opencast matterhorn: A community-driven open source solution for creation, management and distribution of audio and video in academia. In: Proc. of ISM, pp. 687–692 (2009)	es_ES
dc.description.references	Kneser, R., Ney, H.: Improved Backing-off for M-gram Language Modeling. In: Proc. of ICASSP, pp. 181–184 (1995)	es_ES
dc.description.references	Lecorv, G., Gravier, G., Sbillot, P.: An unsupervised web-based topic language model adaptation method. In: Proc. of ICASSP 2008, pp. 5081–5084 (2008)	es_ES
dc.description.references	Martínez-Villaronga, A., del Agua, M.A., Andrés-Ferrer, J., Juan, A.: Language model adaptation for video lectures transcription. In: Proc. of ICASSP, pp. 8450–8454 (2013)	es_ES
dc.description.references	Munteanu, C., Penn, G., Baecker, R.: Web-based language modelling for automatic lecture transcription. In: Proc. of INTERSPEECH, pp. 2353–2356 (2007)	es_ES
dc.description.references	Rogina, I., Schaaf, T.: Lecture and presentation tracking in an intelligent meeting room. In: Proc of ICMI, pp. 47–52 (2002)	es_ES
dc.description.references	Schlippe, T., Gren, L., Vu, N.T., Schultz, T.: Unsupervised language model adaptation for automatic speech recognition of broadcast news using web 2.0, pp. 2698–2702 (2013)	es_ES
dc.description.references	Seide, F., Li, G., Chen, X., Yu, D.: Feature engineering in context-dependent deep neural networks for conversational speech transcription. In: Proc. of ASRU, pp. 24–29 (2011)	es_ES
dc.description.references	Silvestre, J.A., et al.: Translectures. In: Proc. of IberSPEECH 2012, pp. 345–351 (2012)	es_ES
dc.description.references	Smith, R.: An overview of the tesseract ocr engine. In: Proc. of ICDAR 2007, pp. 629–633 (2007)	es_ES
dc.description.references	Stolcke, A.: SRILM – an extensible language modeling toolkit. In: Proc. of ICSLP, pp. 901–904 (2002)	es_ES
dc.description.references	Tsiartas, A., Georgiou, P., Narayanan, S.: Language model adaptation using www documents obtained by utterance-based queries. In: Proc. of ICASSP, pp. 5406–5409 (2010)	es_ES

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem

Language model adaptation for lecture transcription by document retrieval

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Buscar en RiuNet

Listar

Todo RiuNet

Esta colección

Mi cuenta

Estadísticas

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Language model adaptation for lecture transcription by document retrieval

Ficheros en el ítem

Este ítem aparece en la(s) siguiente(s) colección(ones)