Mostrar el registro sencillo del ítem
dc.contributor.author | Martínez-Villaronga, Adrià | es_ES |
dc.contributor.author | Del Agua Teba, Miguel Angel | es_ES |
dc.contributor.author | Silvestre Cerdà, Joan Albert | es_ES |
dc.contributor.author | Andrés Ferrer, Jesús | es_ES |
dc.contributor.author | Juan, Alfons | es_ES |
dc.date.accessioned | 2015-05-21T18:19:52Z | |
dc.date.available | 2015-05-21T18:19:52Z | |
dc.date.issued | 2014 | |
dc.identifier.issn | 0302-9743 | |
dc.identifier.uri | http://hdl.handle.net/10251/50657 | |
dc.description | The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-319-13623-3_14 | es_ES |
dc.description.abstract | With the spread of MOOCs and video lecture repositories it is more important than ever to have accurate methods for automatically transcribing video lectures. In this work, we propose a simple yet effective language model adaptation technique based on document retrieval from the web. This technique is combined with slide adaptation, and compared against a strong baseline language model and a stronger slide-adapted baseline. These adaptation techniques are compared within two different acoustic models: a standard HMM model and the CD-DNN-HMM model. The proposed method obtains improvements on WER of up to 14% relative with respect to a competitive baseline as well as outperforming slide adaptation. | es_ES |
dc.description.sponsorship | The research leading to these results has received fund-ing from the European Union Seventh Framework Programme (FP7/2007-2013)under grant agreement no 287755 (transLectures) and ICT Policy Support Pro-gramme (ICT PSP/2007-2013) as part of the Competitiveness and Innovation Framework Programme (CIP) under grant agreement no 621030 (EMMA), the Spanish MINECO Active2Trans (TIN2012-31723) research project and the Spanish Government with the FPU scholarships FPU13/06241 and AP2010-4349. | es_ES |
dc.language | Inglés | es_ES |
dc.publisher | Springer Verlag (Germany) | es_ES |
dc.relation.ispartof | Advances in Speech and Language Technologies for Iberian Languages | es_ES |
dc.relation.ispartofseries | Lecture Notes in Computer Science;8854 | |
dc.rights | Reserva de todos los derechos | es_ES |
dc.subject | Language model adaptation | es_ES |
dc.subject | Video lectures | es_ES |
dc.subject | Document retrieval | es_ES |
dc.subject.classification | ESTADISTICA E INVESTIGACION OPERATIVA | es_ES |
dc.subject.classification | CIENCIAS DE LA COMPUTACION E INTELIGENCIA ARTIFICIAL | es_ES |
dc.subject.classification | LENGUAJES Y SISTEMAS INFORMATICOS | es_ES |
dc.title | Language model adaptation for lecture transcription by document retrieval | es_ES |
dc.type | Capítulo de libro | es_ES |
dc.identifier.doi | 10.1007/978-3-319-13623-3_14 | |
dc.relation.projectID | info:eu-repo/grantAgreement/EC/FP7/287755/EU/Transcription and Translation of Video Lectures/ | es_ES |
dc.relation.projectID | info:eu-repo/grantAgreement/EC/CIP/621030/EU/European Multiple MOOC Aggregator/EMMA/ | es_ES |
dc.relation.projectID | info:eu-repo/grantAgreement/MINECO//TIN2012-31723/ES/INTERACCION ACTIVA PARA TRANSCRIPCION DE HABLA Y TRADUCCION/ | es_ES |
dc.relation.projectID | info:eu-repo/grantAgreement/MECD//FPU13%2F06241/ES/FPU13%2F06241/ | es_ES |
dc.relation.projectID | info:eu-repo/grantAgreement/MECD//AP2010-4349/ES/AP2010-4349/ | es_ES |
dc.rights.accessRights | Abierto | es_ES |
dc.contributor.affiliation | Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació | es_ES |
dc.contributor.affiliation | Universitat Politècnica de València. Departamento de Estadística e Investigación Operativa Aplicadas y Calidad - Departament d'Estadística i Investigació Operativa Aplicades i Qualitat | es_ES |
dc.description.bibliographicCitation | Martínez-Villaronga, A.; Del Agua Teba, MA.; Silvestre Cerdà, JA.; Andrés Ferrer, J.; Juan, A. (2014). Language model adaptation for lecture transcription by document retrieval. En Advances in Speech and Language Technologies for Iberian Languages. Springer Verlag (Germany). 129-137. https://doi.org/10.1007/978-3-319-13623-3_14 | es_ES |
dc.description.accrualMethod | S | es_ES |
dc.relation.publisherversion | http://dx.doi.org/10.1007/978-3-319-13623-3_14 | es_ES |
dc.description.upvformatpinicio | 129 | es_ES |
dc.description.upvformatpfin | 137 | es_ES |
dc.type.version | info:eu-repo/semantics/publishedVersion | es_ES |
dc.relation.senia | 277410 | |
dc.contributor.funder | European Commission | es_ES |
dc.contributor.funder | Ministerio de Economía y Competitividad | es_ES |
dc.contributor.funder | Ministerio de Educación, Cultura y Deporte | es_ES |
dc.description.references | coursera.org: Take the World’s Best Courses, Online, For Free, http://www.coursera.org/ | es_ES |
dc.description.references | poliMedia: Videolectures from the “Universitat Politècnica de València, http://polimedia.upv.es/catalogo/ | es_ES |
dc.description.references | SuperLectures: We take full care of your event video recordings, http://www.superlectures.com | es_ES |
dc.description.references | transLectures, https://translectures.eu/ | es_ES |
dc.description.references | transLectures-UPV Toolkit (TLK) for Automatic Speech Recognition, http://translectures.eu/tlk | es_ES |
dc.description.references | Udacity: Learn, Think, Do, http://www.udacity.com/ | es_ES |
dc.description.references | Videolectures.NET: Exchange Ideas and Share Knowledge, http://www.videolectures.net/ | es_ES |
dc.description.references | del-Agua, M.A., Giménez, A., Serrano, N., Andrés-Ferrer, J., Civera, J., Sanchis, A., Juan, A.: The translectures-UPV toolkit. In: Navarro Mesa, J.L., Giménez, A.O., Teixeira, A. (eds.) IberSPEECH 2014. LNCS (LNAI), vol. 8854, pp. 269–278. Springer, Heidelberg (2014) | es_ES |
dc.description.references | Chang, P.C., Shan Lee, L.: Improved language model adaptation using existing and derived external resources. In: Proc. of ASRU, pp. 531–536 (2003) | es_ES |
dc.description.references | Chen, S.F., Goodman, J.: An empirical study of smoothing techniques for language modeling. Computer Speech & Language 13(4), 359–393 (1999) | es_ES |
dc.description.references | Jelinek, F., Mercer, R.L.: Interpolated Estimation of Markov Source Parameters from Sparse Data. In: Proc. of the Workshop on Pattern Recognition in Practice, pp. 381–397 (1980) | es_ES |
dc.description.references | Ketterl, M., Schulte, O.A., Hochman, A.: Opencast matterhorn: A community-driven open source solution for creation, management and distribution of audio and video in academia. In: Proc. of ISM, pp. 687–692 (2009) | es_ES |
dc.description.references | Kneser, R., Ney, H.: Improved Backing-off for M-gram Language Modeling. In: Proc. of ICASSP, pp. 181–184 (1995) | es_ES |
dc.description.references | Lecorv, G., Gravier, G., Sbillot, P.: An unsupervised web-based topic language model adaptation method. In: Proc. of ICASSP 2008, pp. 5081–5084 (2008) | es_ES |
dc.description.references | Martínez-Villaronga, A., del Agua, M.A., Andrés-Ferrer, J., Juan, A.: Language model adaptation for video lectures transcription. In: Proc. of ICASSP, pp. 8450–8454 (2013) | es_ES |
dc.description.references | Munteanu, C., Penn, G., Baecker, R.: Web-based language modelling for automatic lecture transcription. In: Proc. of INTERSPEECH, pp. 2353–2356 (2007) | es_ES |
dc.description.references | Rogina, I., Schaaf, T.: Lecture and presentation tracking in an intelligent meeting room. In: Proc of ICMI, pp. 47–52 (2002) | es_ES |
dc.description.references | Schlippe, T., Gren, L., Vu, N.T., Schultz, T.: Unsupervised language model adaptation for automatic speech recognition of broadcast news using web 2.0, pp. 2698–2702 (2013) | es_ES |
dc.description.references | Seide, F., Li, G., Chen, X., Yu, D.: Feature engineering in context-dependent deep neural networks for conversational speech transcription. In: Proc. of ASRU, pp. 24–29 (2011) | es_ES |
dc.description.references | Silvestre, J.A., et al.: Translectures. In: Proc. of IberSPEECH 2012, pp. 345–351 (2012) | es_ES |
dc.description.references | Smith, R.: An overview of the tesseract ocr engine. In: Proc. of ICDAR 2007, pp. 629–633 (2007) | es_ES |
dc.description.references | Stolcke, A.: SRILM – an extensible language modeling toolkit. In: Proc. of ICSLP, pp. 901–904 (2002) | es_ES |
dc.description.references | Tsiartas, A., Georgiou, P., Narayanan, S.: Language model adaptation using www documents obtained by utterance-based queries. In: Proc. of ICASSP, pp. 5406–5409 (2010) | es_ES |