- -

The TransLectures-UPV Toolkit

RiuNet: Institutional repository of the Polithecnic University of Valencia

Share/Send to

Cited by

Statistics

The TransLectures-UPV Toolkit

Show simple item record

Files in this item

dc.contributor.author Del Agua Teba, Miguel Angel es_ES
dc.contributor.author Giménez Pastor, Adrián es_ES
dc.contributor.author Serrano Martinez Santos, Nicolas es_ES
dc.contributor.author Andrés Ferrer, Jesús es_ES
dc.contributor.author Civera Saiz, Jorge es_ES
dc.contributor.author Sanchis Navarro, José Alberto es_ES
dc.contributor.author Juan Císcar, Alfonso es_ES
dc.date.accessioned 2015-05-19T10:02:58Z
dc.date.available 2015-05-19T10:02:58Z
dc.date.issued 2014
dc.identifier.isbn 978-3-319-13622-6
dc.identifier.issn 0302-9743
dc.identifier.uri http://hdl.handle.net/10251/50452
dc.description The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-319-13623-3_28 es_ES
dc.description.abstract Over the past few years, online multimedia educational repositories have increased in number and popularity. The main aim of the transLectures project is to develop cost-effective solutions for producing accurate transcriptions and translations for large video lecture repositories, such as VideoLectures.NET or the Universitat Politècnica de València s repository, poliMedia. In this paper, we present the transLectures-UPV toolkit (TLK), which has been specifically designed to meet the requirements of the transLectures project, but can also be used as a conventional ASR toolkit. The main features of the current release include HMM training and decoding with speaker adaptation techniques (fCMLLR). TLK has been tested on the VideoLectures.NET and poliMedia repositories, yielding very competitive results. TLK has been released under the permissive open source Apache License v2.0 and can be directly downloaded from the transLectures website. es_ES
dc.description.sponsorship The research leading to these results has received funding from the European Union Seventh Framework Programme (FP7/2007-2013) under grant agreement no 287755 (transLectures) and ICT Policy Support Programme (ICT PSP/2007-2013) as part of the Competitiveness and InnovationFramework Programme (CIP) under grant agreement no 621030 (EMMA), andthe Spanish MINECO Active2Trans (TIN2012-31723) research project. es_ES
dc.language Inglés es_ES
dc.publisher Springer International Publishing es_ES
dc.relation EC/CIP/621030 es_ES
dc.relation info:eu-repo/grantAgreement/MINECO//TIN2012-31723/ES/INTERACCION ACTIVA PARA TRANSCRIPCION DE HABLA Y TRADUCCION/ es_ES
dc.relation.ispartof Advances in Speech and Language Technologies for Iberian Languages: Second International Conference, IberSPEECH 2014, Las Palmas de Gran Canaria, Spain, November 19-21, 2014. Proceedings es_ES
dc.relation.ispartofseries Lecture Notes in Computer Science;8854
dc.rights Reserva de todos los derechos es_ES
dc.subject TLK es_ES
dc.subject ASR toolkit es_ES
dc.subject transLectures es_ES
dc.subject HMM es_ES
dc.subject.classification ESTADISTICA E INVESTIGACION OPERATIVA es_ES
dc.subject.classification CIENCIAS DE LA COMPUTACION E INTELIGENCIA ARTIFICIAL es_ES
dc.subject.classification LENGUAJES Y SISTEMAS INFORMATICOS es_ES
dc.title The TransLectures-UPV Toolkit es_ES
dc.type Capítulo de libro es_ES
dc.identifier.doi 10.1007/978-3-319-13623-3_28
dc.relation.projectID info:eu-repo/grantAgreement/EC/FP7/287755/EU/Transcription and Translation of Video Lectures/ es_ES
dc.rights.accessRights Abierto es_ES
dc.contributor.affiliation Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació es_ES
dc.contributor.affiliation Universitat Politècnica de València. Departamento de Estadística e Investigación Operativa Aplicadas y Calidad - Departament d'Estadística i Investigació Operativa Aplicades i Qualitat es_ES
dc.description.bibliographicCitation Del Agua Teba, MA.; Giménez Pastor, A.; Serrano Martinez Santos, N.; Andrés Ferrer, J.; Civera Saiz, J.; Sanchis Navarro, JA.; Juan Císcar, A. (2014). The TransLectures-UPV Toolkit. En Advances in Speech and Language Technologies for Iberian Languages: Second International Conference, IberSPEECH 2014, Las Palmas de Gran Canaria, Spain, November 19-21, 2014. Proceedings. Springer International Publishing. 269-278. https://doi.org/10.1007/978-3-319-13623-3_28 es_ES
dc.description.accrualMethod S es_ES
dc.relation.publisherversion http://link.springer.com/chapter/10.1007/978-3-319-13623-3_28 es_ES
dc.description.upvformatpinicio 269 es_ES
dc.description.upvformatpfin 278 es_ES
dc.type.version info:eu-repo/semantics/publishedVersion es_ES
dc.relation.senia 276983
dc.contributor.funder European Commission es_ES
dc.contributor.funder Ministerio de Economía y Competitividad es_ES
dc.description.references Final report on massive adaptation (M36). To be delivered on October 2014 (2014) es_ES
dc.description.references First report on massive adaptation (M12), https://www.translectures.eu/wp-content/uploads/2013/05/transLectures-D3.1.1-18Nov2012.pdf es_ES
dc.description.references Opencast Matterhorn, http://opencast.org/matterhorn/ es_ES
dc.description.references sclite - Score speech recognition system output, http://www1.icsi.berkeley.edu/Speech/docs/sctk-1.2/sclite.htm es_ES
dc.description.references Second report on massive adaptation (M24), https://www.translectures.eu//wp-content/uploads/2014/01/transLectures-D3.1.2-15Nov2013.pdf es_ES
dc.description.references TLK: The transLectures-UPV Toolkit, https://www.translectures.eu/tlk/ es_ES
dc.description.references Baum, L.E., Petrie, T., Soules, G., Weiss, N.: A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains. The Annals of Mathematical Statistics 41(1), 164–171 (1970) es_ES
dc.description.references Dahl, G.E., Yu, D., Deng, L., Acero, A.: Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition. IEEE Transactions on Audio, Speech, and Language Processing 20(1), 30–42 (2012) es_ES
dc.description.references Digalakis, V., Rtischev, D., Neumeyer, L., Sa, E.: Speaker Adaptation Using Constrained Estimation of Gaussian Mixtures. IEEE Transactions on Speech and Audio Processing 3, 357–366 (1995) es_ES
dc.description.references Huang, J.T., Li, J., Yu, D., Deng, L., Gong, Y.: Cross-language knowledge transfer using multilingual deep neural network with shared hidden layers. In: Proc. of ICASSP (2013) es_ES
dc.description.references Munteanu, C., Baecker, R., Penn, G., Toms, E., James, D.: The Effect of Speech Recognition Accuracy Rates on the Usefulness and Usability of Webcast Archives. In: Proc. of CHI, pp. 493–502 (2006) es_ES
dc.description.references Ney, H., Ortmanns, S.: Progress in dynamic programming search for LVCSR. Proceedings of the IEEE 88(8), 1224–1240 (2000) es_ES
dc.description.references Ortmanns, S., Ney, H., Eiden, A.: Language-model look-ahead for large vocabulary speech recognition. In: Proc. of ICSLP, vol. 4, pp. 2095–2098 (1996) es_ES
dc.description.references Ortmanns, S., Ney, H., Aubert, X.: A word graph algorithm for large vocabulary continuous speech recognition. Computer Speech and Language 11(1), 43–72 (1997) es_ES
dc.description.references Povey, D., et al.: The Kaldi Speech Recognition Toolkit. In: Proc. of ASRU (2011) es_ES
dc.description.references Rumelhart, D., Hintont, G., Williams, R.: Learning representations by back-propagating errors. Nature 323(6088), 533–536 (1986) es_ES
dc.description.references Rybach, D., et al.: The RWTH Aachen University Open Source Speech Recognition System. In: Proc. Interspeech, pp. 2111–2114 (2009) es_ES
dc.description.references Seide, F., Li, G., Chen, X., Yu, D.: Feature engineering in Context-Dependent Deep Neural Networks for conversational speech transcription. In: Proc. of ASRU, pp. 24–29 (2011) es_ES
dc.description.references Viterbi, A.: Error bounds for convolutional codes and an asymptotically optimum decoding algorithm. IEEE Transactions on Information Theory 13(2), 260–269 (1967) es_ES
dc.description.references Young, S., et al.: The HTK Book. Cambridge University Engineering Department (1995) es_ES
dc.description.references Young, S.J., Odell, J.J., Woodland, P.C.: Tree-based state tying for high accuracy acoustic modelling. In: Proc. of HLT, pp. 307–312 (1994) es_ES


This item appears in the following Collection(s)

Show simple item record